Group Representation Theory, eds. M. Geek, D. Testerman, J. Thevenaz, 
EPFL Press, Lausanne 2007, pp. 405-450 



Bounds for the orders of the finite subgroups of G(k) 



Jean-Pierre SERRE 



Introduction 

The present text reproduces - with a number of additions - a series of three 
two-hour lectures given at the Ecole Polytechnique Federale de Lausanne 
(E.P.F.L.) on May 25-26-27, 2005. 

The starting point is a classical result of Minkowski, dating from 1887, 
which gives a multiplicative upper bound for the orders of the finite sub- 
groups of GL n (Q). The method can easily be extended to other algebraic 
groups than GL n , and the field Q can be replaced by any number field. 
What is less obvious is that: 

a) one can work over an arbitrary ground field; 

b) in most cases one may construct examples showing that the bound 
thus obtained is optimal. 

This is what I explain in the lectures. 

Lecture I is historical: Minkowski (§1), Schur (§2), Blichfeldt and others 
(§3). The results it describes are mostly well-known, so that I did not feel 
compelled to give complete proofs. 

Lecture II gives upper bounds for the order of a finite f-subgroup of G(k), 
where G is a reductive group over a field k, and I is a prime number. These 
bounds depend on G via its root system, and on k via the size of the Galois 
group of its £-cyclotomic tower (§4). One of these bounds (called here the 
S-bound, cf. §5) is a bit crude but is easy to prove and to apply. The 
second one (called the M-bound) is the most interesting one (§6). Its proof 
follows Minkowski's method, combined with Chebotarev's density theorem 
(for schemes of any dimension, not merely dimension 1); it has a curious 
cohomological generalization cf. §6.8. The last subsection (§6.9) mentions 
some related problems, not on semisimple groups, but on Cremona groups; 
for instance: does the field Q(X,Y,Z) have an automorphism of order 11 ? 

Lecture III gives the construction of "optimal" large subgroups. The case 
of the classical groups (§9) is not difficult. Exceptional groups such as E% 
are a different matter; to handle them, we shall use Galois twists, braid 
groups and Tits groups, cf. §§10-12. 
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I. History: Minkowski, Schur, ... 

§1. Minkowski 

Reference: [Mi 87]. 

1.1. Statements. We shall use the following notation: 

£ is a fixed prime number; when we need other primes we usually denote 
them by p; 

the ^-adic valuation of a rational number x is denoted by vg(x); one has 
vi(£) = 1, and v^{x) = if x is an integer with (x,£) = 1; 

the number of elements of a finite set A is denoted by \A\; we write vi{A) 
instead of t^(|A|); if A is a group, £ vl{A) is the order of an ^-Sylow of A; 
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if x is a real number, its integral part ("floor") is denoted by [x]. 
We may now state Minkowski's theorem ([Mi 87]): 

Theorem 1. Let n be an integer > 1, and let £ be a prime number. Define: 



M(n,£) 



n 



+ 



n 



l(£-\) 



+ 



n 



£ 2 (£-l)_ 



+ ... 



Then: 

(i) If A is a finite subgroup o/GL n (Q), we have vi(A) < M(n,£). 

(ii) There exists a finite £-subgroup A o/GL n (Q) with ve(A) = M(n,£). 

The proof will be given in §1.3 and §1.4. 

Remarks. 

1) Let us define an integer M(n) by: 



M(n) 



I 



Part (i) of th.l says that the order of any finite subgroup of GL n (Q) di- 
vides M(n), and part (ii) says that M(n) is the smallest integer having this 
property. Hence M(n) is a sharp multiplicative bound for \A\. 
Here are the values of M{n) for n < 8: 

M (1) = 2 

M(2) = 2 3 - 3 = 24 

M(3) = 2 4 - 3 = 48 

M(4) = 2 7 -3 2 -5 = 5760 

M(5) = 2 8 - 3 2 - 5 = 11520 

M(6) = 2 10 - 3 4 - 5 • 7 = 2903040 

M (7) = 2 11 • 3 4 • 5 • 7 = 5806080 

M(8) = 2 15 - 3 5 - 5 2 - 7 = 1393459200. 

Note that 



M(n)/M(n - 1) = < 



'2 if n is odd 

denominator of b n /n if n is even, 



where b n is the n-th Bernoulli number. (The occurence of the Bernoulli 
numbers is natural in view of the mass formulae which Minkowski had proved 
a few years before.) 

2) One may ask whether there is a finite subgroup A of GL n (Q) of order 
M(n). It is so for n = 1 and n = 3 and probably for no other value of n (as 
Burnside already remarked on p. 484 of [Bu 11]). Indeed, some incomplete 
arguments of Weisfeiler and Feit would imply that the upper bound of \A\ 
is 2 n • n\ if n > 10, which is much smaller than M(n). See the comments of 
Guralnick-Lorenz in [GL 06], §6.1. 



Exercise. Let 



— a i£ l i < Oj < ^ — 1, be the ^-adic expansion of 



e-i 



Show that M{n,l) = Y,ai^=i k = E M{a i l i {£ - 1),£). 
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1.2. Minkowski's lemma. Minkowski's paper starts with the following 
often quoted lemma: 

Lemma 1. If m > 3, the kernel o/GL n (Z) — > GL n (Z/mZ) is torsion free. 

Proof. Easy exercise ! One may deduce it from general results on formal 
groups over local rings, cf. Bourbaki [LIE III], §7. Many variants exist. For 
instance: 

Lemma 1'. Let R be a local ring with maximal ideal m and residue field 
k = R/m. If i is a prime number distinct from char(£i), the kernel of the 
map GL„(i?) —■ GL„(/e) does not contain any element of order I. 

Proof. Suppose x € GL n (i?) has order i and gives 1 in GL„(A;). Write 
x = 1 + y; all the coefficients of the matrix y belong to m. Since x l = 1, we 
have 

which we may write as y ■ u = 0, with u = i + ( 2 )y + • • • + y^~ l - The image 
of u in GL n (fc) is I, which is invertible. Hence u is invertible, and since y ■ u 
is 0, this shows that y = 0. □ 

Several other variants can be found in [SZ 96]. 

Remark. A nice consequence of lemma 1' is the following result of Malcev 
and Selberg ([Bo 69], §17): 

(*) Let r be a finitely generated subgroup of GL n (K), where K is a field 
of characteristic 0. Then T has a torsion free subgroup of finite index. 

Sketch of proof (for more details, see Borel, loc.cit.). Let S be a finite 
generating subset of T, and let L be the ring generated by the coefficients 
of the elements of S U S -1 . We have T C GL„(L). Let m be a maximal 
ideal of L; the residue field k = A/m is finite ([AC V], p. 68, cor.l to th.3); 
let p be its characteristic. The kernel ri of T — » GL n (fc) has finite index in 
T; by lemma V (applied to the local ring R = L m ), T\ does not have any 
torsion except possibly p-torsion. By choosing another maximal ideal of L, 
with a different residue characteristic, one gets a torsion free subgroup of 
finite index of Ti, and hence of F. □ 

Remark. When K has characteristic p > the same proof shows that T 
has a subgroup of finite index which is "p'-torsion free", i.e. such that its 
elements of finite order have order a power of p. 

1.3. Proof of theorem 1 (i). Let A be a finite subgroup of GL n (Q); we 

have to show that vg{A) < M(n,£). Note first: 

1.3.1. The group A is conjugate to a subgroup o/GL n (Z). 

This amounts to saying that there exists an yl-stable lattice in Q n , which 
is clear: just take the lattice generated by the A-transforms of the standard 
lattice Z n . 
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1.3.2. There is a positive definite quadratic form on Q n , with integral coef- 
ficients, which is invariant by A. 

Same argument: take the sum of the ^4-transforms of x\ + • • • + x^, and 
multiply it by a suitable non-zero integer, in order to cancel any denomina- 
tor. 

Let us now proceed with the proof of V(,(A) < M(n,£). We do it in two 
steps: 

1.3.3. The case £ > 2. 

By 1.3.1, we may assume that A is contained in GL n (Z). Let p be a 
prime number 7^ 2. By lemma 1, the map A — > GL„(Z/pZ) is injective. 
Hence 

v t (A) < a(p) = v e (GL n (Z/ P Z)). 

The order of GL n (Z/pZ) is p n ( n - 1 )/ 2 (p - \){p 2 - 1) . . . (p n - 1). Let us 
assume that p 7^ I. Then we have 



»(p) = ^v l {p i - 1). 



We now choose p in such a way that a{p) is as small as possible. More 
precisely, we choose p such that: 

(*) The image of p in (Z/£ 2 Z)* is a generator of that group. 

This is possible by Dirichlet's theorem on the existence of primes in 
arithmetic progressions (of course, one should also observe that (Z/^ 2 Z)* 
is cyclic.) 

Once p is chosen in that way, then p % — 1 is divisible by i only if i is 
divisible by £ — 1; moreover, one has vg(p — 1) = 1 because of (*), and 
this implies that vi(p l — 1) = l + vi(i) if i is divisible by £ — 1. (This is where 
the hypothesis £ > 2 is used.) One can then compute a(p) by the formula 

above. The number of indices i < n which are divisible by i — 1 is 

We thus get: 



a(p) 



n 



£-1 



n 



l-l 



n 



n 



+ 



At- 1 ). 



+ 



1- 1 

M(n,£). 



n 



£-1 







This proves th.l (i) in the case £ / 2. 



1.3.4. The case £ = 2. 

When £ = 2, the method above does not give the right bound as soon 
as n > 1. One needs to replace GL ra by an orthogonal group. Indeed, by 
1.3.1 and 1.3.2, we may assume, not only that A is contained in GL n (Z), 
but also that it is contained in the orthogonal group O n (q), where q is 
a non-degenerate quadratic form with integral coefficients. Let D be the 
discriminant of q, and let us choose a prime number p > 2 which does not 
divide D. The image of A in GL n (Z/pZ) is contained in the orthogonal 
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group O n (Z/pZ) relative to the reduction of q mod p. If we put r = [n/2], 
the order of O n (Z/pZ) is known to be: 



2 -p r \p A - l)(p 4 - 1) . . . (p 2r - 1) if n is odd. 



and 



2 • p^-^Cp 2 - l)(p 4 - 1) . . . (p 2r - l)/(p r + e) if n is even, 



with e = ±1 equal to the Legendre symbol at p of (— l) r D. 

If we choose p = ±3 (mod 8), we have V2(p 2t — 1) = 3 + V2(i), and 
U2(p r + e) > 1. If n is odd, this gives 







r" 




+ 




.2. 


.4. 



v 2 (O n (Z/pZ)) = 1 + 3r + u 2 (r!) = n + r + 
and, if n is even: 

«2(O n (Z/pZ)) < 3r + «2(r!) = M(n, 2) 
Hence i>2(^4) is at most equal to M(n, 2). 



+ 



M(n,2), 



□ 



Remark. There are several ways of writing down this proof. For instance: 

- There is no need to embed A in GL n (Z). It sits in GL„(Z[l/iV]) for a 
suitable N > 1, and this allows us to reduce mod p for all p's not dividing 
N. 

- Minkowski's lemma is not needed either: we could replace it by the 
trivial fact that a matrix which is different from 1 is not congruent to 1 
mo dp for all large enough p's. 

- Even when £ > 2, we could have worked in O n instead of GL„; that is 
what Minkowski does. 

- When £ = 2 the case n even can be reduced to the case n odd by 
observing that, if A C GL n (Q), then A x {±1} embeds into GL n+ i(Q), 
and M(n + 1, 2) is equal to 1 + M(n, 2). 



1.4. Proof of theorem 1 (ii). The symmetric group Sg has a faithful 
representation Sg — > GL(Vi) where V\ is a Q-vector space of dimension 

I -I. Put r = 



, and let V = V\ © • • • © V r be the direct sum of r copies 

of V\. Let S* be the semi-direct product of S r with the product (Si) 1 " of r 
copies of Se ("wreath product"). The group S has a natural, and faithful, 
action on V. We may thus view S as a subgroup of GL r ^_ 1 ^(Q), hence also 
of GL„(Q), since n > r(£ — 1). We have 



v e (S) = r + ve(r\) 



n 



+ 



n 



l(t-\) 



+ 



M(n,£). 



An £-Sylow A of S* satisfies the conditions of th.l (ii). □ 

Example. When £ = 2 the group S defined above is the "hyper-octahedral 
group" , i.e. the group of automorphisms of an n-cube (= the Weyl group of 
a root system of type B n ); in ATLAS notation, it may be written as 2 n • S n . 
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1.5. A conjugacy theorem. The finite ^-subgroups of GL n (Q) have the 
following Sylow-like property: 

Theorem 1'. Let A and A' be two finite (-subgroups o/GL n (Q). Assume 
that A has the maximal order allowed by th.l. Then A' is conjugate to a 
subgroup of A. 

Corollary. If\A\ = \A'\ = £ M (™< £ ) , then A and A' are conjugate in GL n (Q). 

Proof of theorem 1'. See Bourbaki, [LIE III], §7, exerc.6 f) where only the 
case I > 2 is given, and Feit [Fe 97] who does the case i = 2. Let us sketch 
Bourbaki's method (which we shall use in §6.6 in a more general setting): 

We may assume that A and A' are contained in GL„(Z). Choose a 
prime p as in 1.3.3, and reduce mod p. The groups A and A' then become 
^-subgroups of G p = GL n (Z/pZ), and A is an ^-Sylow of G p . By Sylow's 
theorem applied to G p , one finds an injection i : A' —> A which is induced 
by an inner automorphism of G p . The two linear representations of A': 

A' -»• GL n (Q) and A' 4 A -»• GL n (Q) 

become isomorphic after reduction mod p. Since p ^ £, & standard argument 
shows that they are isomorphic over Q, which proves th.l' in that case. The 
case I = 2 can be handled by a similar, but more complicated, argument: if 
n is odd, one uses orthogonal groups as in 1.3.4, and one reduces the case n 
even to the case n odd by the trick mentioned at the end of §1.3. □ 

Exercise. Let A(n) be a maximal 2-subgroup of GL„(Q). Show that the A(n)'s 
can be characterized by the following three properties: 

A(l) = {±1}. 

A(2n) = (A(n) x A(n)) ■ {±1} (wreath product) if n is a power of 2. 
A(n) = A{2 mi ) x ■■■ x A(2 m ")tfn = 2 mi -\ h 2™" with m x < ■ ■ ■ < m k . 



§2. Schur 

Ten years after [Mi 87], Frobenius founded the theory of characters of 
finite groups. It was then (and still is now) very tempting to use that theory 
to give a different proof of Minkowski's results. The first people to do so 
were Schur ([Sch 05]) and Burnside ([Bu 11], Note G). Schur 's paper is 
especially interesting. He works first over Q, as Minkowski did, and uses a 
very original argument in character theory, see §2.1 below. He then attacks 
the case of an arbitrary number field, where he gets a complete answer, see 
§2.2. 

2.1. Finite linear groups with rational trace. What Schur proves in §1 
of [Sch 05] is: 

Theorem 2. Let A be a finite l-subgroup o/GL n (C). Assume that the traces 
of the elements of A lie in Q. Then v^(A) < M(n,£), where M(n,£) is as 
in th.l. 
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The condition on the traces is obviously satisfied if A is contained in 
GL n (Q). Hence th.2 is a generalization of th.l. (As a matter of fact, it is a 
genuine generalization only when I = 2; indeed, when I > 2, it is known, cf. 
[Ro 58], that a finite ^-subgroup of GL n (C) with rational trace is conjugate 
to a subgroup of GL„(Q).) 

Proof. We start from the following general fact, which is implicit in [Sch 05] 
(and is sometimes called "Blichfeldt's lemma"): 

Proposition 1. Let G be a finite subgroup o/GL n (C) and let X be the sub- 
set of C made up of the elements Tr(g) for g G G, g ^ 1. Let N = Y[( n ~ x ) 
be the product of the n — x, for x £ X. Then N is a non-zero integer which 
is divisible by \G\. 

(Hence the knowledge of the set X gives a multiplicative bound for the 
order of G.) 

Proof. Let m = \G\, and let z be a primitive m-th root of unity. The 
elements of X are sums of powers of z; hence they belong to the ring of 
integers of the cyclotomic field K = Q(z). This already shows that N is an 
algebraic integer. If s is an element of Gal(A/Q), one has s(z) = z a for some 
a £ (Z/mZ)*. If x = Tr(<?), with g £ G, then s(x) = Tr(g a ), hence s(x) 
belongs to X. This shows that X is stable under the action of Gal(A/Q); 
hence N is fixed by Gal(A/Q); this proves that N belongs to Z. 

The factors of N are / 0. Indeed, Tr(g) is equal to the sum of n complex 
numbers zi with \zi\ = 1, hence can be equal to n only if all the Zi are equal 
to 1, which is impossible since g ^ 1. This shows that N / (one could 
also prove that A is positive, but we shall not need it). 

It remains to see that N is divisible by |G|. It is well-known that, if % is 
a generalized character of G, the sum J2 g( z G x{g) is divisible by |G|. Let us 
apply this to the function g h-> x{q) = FLeX ( Tr(g) — x) , which is a Z-linear 
combination of the characters g i— > Tr(g) m ,m > 0. Since x(<?) = f° r 9^1 
and x(l) = A, the sum of the x(<?) is equal to A. Hence A is divisible by 
\G\. ' □ 

The next lemma gives an information on the Tr(<7)'s: 

Lemma 2. Let A be as in th.2. If g € A, then Tr(g) may be written as 
n — ly with y € Z and < y < n/{i — 1). 

Proof. Each eigenvalue of g is of order t a for some a > 0, and all the 
eigenvalues with the same a have the same multiplicity. By splitting C n 
according to the a's, one is reduced to the following three cases: 

(1) g = 1 and n = 1. Here Tr(#) = 1 and we take y = 0. 

(2) g has order I and n = t — 1. Here Tr(g) = —1, and y = 1. 

(3) g has order £ a with a > 1 and n = t a ~ x {l - 1). Here Tr(#) = and 
y = £ a ~ 2 (£ - 1). 

In each case we have < y < n/(£ — 1). □ 

End of the proof of theorem 2. We apply prop.l to G = A. By lemma 2, 
each factor n — x of N can be written as ly with 1 < y < d = [n/£ — 1)]. 
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This shows that N divides the product £ d ■ d\ and we have 

v e {N) <d + v e (d\) = [n/{£ - 1)] + [n/£{£ - 1)] + • • • = M(n, £). 
Since |G| divides N, this proves th.2. □ 

Remark. One may ask whether th.2 can be complemented by a conjugacy 
theorem analogous to th.l' of §1.5. The answer is of course "yes" if £ > 2 
(because of th.l'), but it is "no" for i = 2: the dihedral group D4 and 
the quaternion group Qs are non-conjugate 2-subgroups of GL2(C), with 
rational trace, which have the maximal order allowed by th.2, namely 8. 

2.2. Replacing Q by an arbitrary number field. This is what Schur 
does in §§2-6 of [Sch 05]. Before stating his result, some notation is neces- 
sary: 

- k is a number field, viewed as a subfield of C. 

- For each a > 1, z a denotes a primitive o-th root of unity. 

- (assuming I > 2). We put t = [k(ze) : k] and we denote by m the 
maximal a such that k(zi) contains z ta (this notation coincides with Schur's, 
and it will be extended to arbitrary fields in §4 of Lect.II). We put 



M k (n,£) = m 



~n~ 




' n ' 




" n ' 


1. 


+ 


It. 


+ 


J 2 t\ 



+ 



- (assuming £ = 2). We put t = [k(i) : k] and we define m as explained in 
§4.2 (warning: t and m do not always coincide with Schur's ti and 7712). We 
put: 

M k (n,2) =n+(m'-l) [-J + [-J + [-J +..., 

where m! is equal to m + 1 in case (b) of §4.2 and is equal to m in the other 
cases. 

The main result of [Sch 05] is: 

Theorem 2'. Let A be a finite £- subgroup o/GL n (C) such thatTr(g) belongs 
to k for every g € A. Then ve(A) < Mk(n,£). 

Note that, when k = Q, the integer M^(n, £) is equal to Minkowski's M(n, £); 
hence th.2' is a generalization of th.2. 

Proof. I shall not give all the details of Schur's proof, but just explain its 
main steps. For more information, see [Sch 05] (and also [GL 06] for the 

case £ > 2). 

One of the inputs of the proof is the following result, which had just been 
proved by Blichfeldt ([Bl 04] - see also §3 below): 

2.2.1. Every linear representation of A is monomial. 

Hence one can decompose the vector space C™ as a direct sum of n lines 
D n which are permuted by A. This gives a homomorphism A — V S n \ 
its kernel A' is a normal abelian subgroup of A. Hence: 

2.2.2. The group A has a normal abelian subgroup A' such that [A : A') 
divides n\. 

This led Schur to investigate the case where A is abelian. He proved: 
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2.2.3. If A is as in th.2' , and is abelian, then : 

' m- [f] ifi > 2 

ve(A) < < 

(m'-i) • [f] +n if£ = 2. 

Sketch of proof. Since A is abelian, and the traces of its elements belong 
to k, it is conjugate to a subgroup of GL n (fc). Let R be the fc-subalgebra 
of M. n (k) generated by A. We may write R as a product Y[ Ki, where the 
Ki are cyclotomic extensions of k, of the form k(z (a .), with aj > 0. Let 
ni = [Ki : k\\ then ^rij < n. The image of A in if* is a cyclic group of 
order € % . If £ > 2, it is not difficult to see that a% < m • [y-] for every i. 

Adding up, we find Yl a i — m ' [f]> an d since ve(A) < J2 a i, we § e ^ t ne 
inequality (2.2.3). The case t = 2 is similar. □ 

Once this is done, the case I = 2 follows. Indeed (2.2.2) and (2.2.3) give 
^2(^4) < ^2(^4') + ^2(^0 <n + {m' — t) • [j] + ^2(^0' an d this is equivalent 
to V2{A) < Mk(n, 2). The case £ > 2 requires more work, cf. [Sch 05], §5.D 

Remarks 

1) The bound vi(A) < M^{n,tj is optimal; this is proved by the same 
explicit constructions as in §1.4, cf. [Sch 05], §6. 

2) As we already pointed out in §2.1, the hypothesis Ti{A) C k implies, 
when I > 2, that A is conjugate to a subgroup of GL n (/c). One may then 
use Minkowski's method, as will be explained in §6 for semisimple algebraic 
groups (of course GL n is not semisimple, but the method applies with almost 
no change - the invariant degrees di of §6 have to be replaced by 1, 2, . . . , n). 
The bound found in that way coincides with Schur's. 

For i = 2, if one does not assume that A can be embedded in GL n (fc), 
I do not see how to apply either Minkowski's method or the cohomological 
method of §6.8. This raises interesting questions. For instance, consider a 
finite subgroup A of E$(C), and suppose that the conjugacy classes of the 
elements of A are Q-rational. Is it true that V2(A) < 30, ^(^4) < 13, . . . , as 
would be the case if A were contained in the rational points of a Q-form of 
Ek, cf. §6.3.2 ? 

§3. Blichfeldt and others 

Blichfeldt's theorem (§3.1 below) has already been used in §2.2. The 
results of §3.3 will be applied in §5.4, in order to prove what I call the 
"S-bound". 

3.1. Blichfeldt's theorem. Recall that a finite group A is called super- 
solvable if it has a composition series 

1 = A) C Ai C • • • C A m = A 

where the Ai are normal in A (and not merely in Ai + ±) and the quotients 
Ai/Ai-i are cyclic. One has 

nilpotent =3- supersolvable =3- solvable. 

In particular, an £-group is supersolvable. 
One proves easily: 
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(*) If A is supersolvable and non abelian, there exists an abelian normal 
subgroup A' of A which is not contained in the center of A. 

Recall also that a linear representation V of a group A is called monomial 
if one can split V as a direct sum of lines which are permuted by A. When V 
is irreducible, this amounts to saying that V is induced by a 1-dimensional 
representation of a subgroup of A. 

We may now state Blichfeldt's theorem ([Bl 04], see also [Bu 11], §258): 

Theorem 3. Every complex linear representation of a supersolvable finite 
group is monomial. 

(As a matter of fact, Blichfeldt was only interested in the case where A is 
nilpotent.) 

Proof. The argument is now standard. We may assume that the given rep- 
resentation V is irrreducible and faithful. If A is abelian, we have dim V = 1 
and there is nothing to prove. If not, we choose A' as in (*) above, and we 
split V as V = (BV X , where x runs through the 1-dimensional characters of 
A', and V x is the corresponding eigenspace; let be a non-zero V x ; it is 
distinct from V (otherwise, A' would be central), and every non-zero V x is 
an A-transform of (because V is irreducible). Call B the subgroup of A 
stabilizing V^. We have A 1 C B C A, and \B\ < \A\. Using induction on 
\A\, we may assume that th.3 is true for B; this gives a splitting of as a 
direct sum of lines which are stable under B. By transforming them by A, 
we get the desired splitting of V. □ 

3.2. Borel-Serre. In [BS 53], Borel and I proved: 

Theorem 3'. Let G be a compact real Lie group, and let A be a finite 
supersolvable subgroup of G. There exists a maximal torus T of G which is 
normalized by A. 

Remark. When one applies th.3' to G = U n (C), one recovers th.3. Hence 
th.3' may be viewed as a generalization of Blichfeldt's theorem. 

Proof of theorem 3' (sketch). 

Lemma 3. Let q be a finite dimensional Lie algebra over a field of charac- 
teristic 0, and let s be an automorphism of prime order of g . If s has no 
fixed point ^ 0, then g is nilpotent. 

(Note the analogy with a - much deeper - theorem of Thompson 
[Th 60-64] : if a finite group G has an automorphism of prime order with no 
non-trivial fixed point, then G is nilpotent.) 

Proof of lemma 3. By extending scalars, we may assume that the ground 
field is algebraically closed. Let p be the order of s, and let z be a primitive 
p-th root of unity. Let Qi be the kernel of s — z % in g. We have 

= 0o ©0i © • • • ©0p-l , 

and the hypothesis made on s means that go = 0. One then shows that 
ad(x) p ~ 1 = for every x belonging to one of the g^'s. This implies that the 
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Killing form of g is 0, hence that g is solvable (Cartan's criterion). The fact 
that g is nilpotent follows easily. (For more details, see §4 of [BS 54].) □ 

Once this is done, th.3' is proved by an induction argument similar to the 
one used in the proof of Blichfeldt's theorem, cf. [BS 53], §3. □ 

3.3. Steinberg and Springer-Steinberg. We now come to the setting of 
linear algebraic groups. Let A; be a field, and let G be an algebraic group 
over k. We shall assume in what follows that G is linear and smooth over 
k; the connected component of the identity of G is denoted by G°. Recall 
that G is said to be reductive if it is connected and if its unipotent radical 
(over an algebraic closure of k) is trivial, cf. [Bo 91], §11.21. If k = C, 
such groups correspond (by a standard dictionary, cf. [Se 93], §5) to the 
connected compact Lie groups. [In the literature, a group G such that G° is 
reductive is sometimes called "reductive" ; this is reasonable in characteristic 
0, but not otherwise. Here we prefer that "reductive" implies "connected" .] 
Theorem 3' has the following analogue: 

Theorem 3". Let A be a finite supersolvable group of order prime to char(/c) 
and let G be a reductive group over k on which A acts by k- automorphisms. 
Then there exists a maximal torus T of G, defined over k, which is stable 
under the action of A. 

(When k = C, this is equivalent to th.3', thanks to the dictionary mentioned 
above.) 

Corollary. If A is a finite supersolvable subgroup of G(k), of order prime 
to char (A;), there is a maximal k-torus T of G whose normalizer N is such 
that A is contained in N(k). 

(Recall that, if X is a fc-variety, X(k) is the set of fc-points of X.) 

Proof of theorem 3". When k is algebraically closed, this is proved in [SS 
68], 1.5.16, with the help of several results from [St 68]. For an arbitrary 
field k, the same proof works with very little change. One starts with the 
following basic result of Steinberg ([St 68], th.7.2): 

Proposition 2. Assume k is algebraically closed. Let s : G — >■ G be a 
surjective homomorphism. Then there exists a Borel subgroup B of G such 
that s(B) = B. 

When s has finite order prime to char (A;), one can say much more: 

Proposition 3. Let s be an automorphism of G of finite order prime to 
char(fc), and let G s be the subgroup of G fixed by s. Then: 

a) The connected component of G s is reductive. 

b) One has dim G s > if G is not a torus. 

c) // k is algebraically closed, there exists a Borel subgroup B of G and a 
maximal torus T of B such that s(B) = B and s(T) = T. 

Proof (sketch). We may assume k is algebraically closed, since assertions a) 
and b) are "geometric". A proof of a) is given in [St 68], cor. 9. 4. A proof of 
c) is given in [SS 68], 1.2.9, as an application of prop. 2. Assertion b) follows 
from c) by the following method of Steinberg: one observes that a pair (B, T) 
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with B D T, determines canonically a homomorphism h : G m —> T (indeed 
B gives a basis of the root system of (G,T), and one takes for h twice the 
sum of the corresponding coroots). Moreover, h is non-trivial if G is not a 
torus. The canonicity of h implies that it is fixed by s. Hence G s contains 
lm(h). □ 

End of the proof of th.3" . By induction on \A\ + dimC When A = 1, one 
takes for T any maximal fe-torus of G; by a theorem of Grothendieck, there 
is such a torus (cf. [Bo 91], th.18.2). We may thus assume A ^ 1. In that 
case A contains a cyclic subgroup < s >, non-trivial, which is normal. We 
may also assume that G is semisimple and that A acts faithfully. Let G\ be 
the connected component of G s ; we have dimCi > 0, cf. prop. 3 b). The 
group A/ A' acts on G±. By the induction assumption, there is a maximal 
torus T\ of Gi, defined over k, which is stable under the action of A/A' , 
hence of A. Let G2 be the centralizer of T\ in G. It is a reductive group of 
the same rank as G. We have dim G2 < dim G, since T\ is not contained in 
the center of G. Moreover, G2 is stable under the action by A. By applying 
the induction assumption to the pair (G2 , A) we get a maximal A;-torus T of 
G2 which is ^4-stable. Since G2 and G have the same rank, T is a maximal 
torus of G. □ 



II. Upper bounds 

Let G be a reductive group over a field k, and let i be a prime number, 
different from char (A;). Let A be a finite subgroup of G(k). We want to give 
an upper bound for vg(A), in terms of invariants of G, k and I. We give 
two such bounds. The first one (§5) is less precise, but very easy to apply; 
we call it the S-bound (S for Schur). The other bound (§6) is the M-bound 
(M for Minkowski). Both bounds involve some cyclotomic invariants of k, 
which are defined in §4 below. 

§4. The invariants t and m 

4.0. Cyclotomic characters. Let k be an algebraic closure of k, and let 
k s be the separable closure of k in k. For each n > 1 prime to char(/c), let 
[i n C k* be the group of n-th roots of unity and let z n be a generator of \i n . 

The Galois group T/u — Gal(/c s //c) acts on (z n ) = fi n . This action defines 
a continuous homomorphism 

Xn :r fe ^Aut(/v) = (Z/nZ)*, 

which is called the n-th cyclotomic character of k. 

This applies in particular to n = i d (d = 0, 1, . . . ); by taking inverse limits 
we get the £°°-cyclotomic character 

Xtoo : T k -)■ Z| = Um(Z/^Z)*, 

where is the ring of ^-adic integers. What matters for us is the image 
Im x e oo, which is a closed subgroup of Z|. To discuss its structure, it is 
convenient to separate the cases £ 7^ 2 and 1 = 2. 
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4.1. The case £ ^ 2. We have 

Z £ * = Q_! x {l+£-Z,} 

where is cyclic of order £ — 1 (i.e. Q-i is the group of the 

^-adic field Q^; it is canonically isomorphic to F|). As for 1 + £-Z#, it is 
procyclic, generated by 1 + £, and isomorphic to the additive group Zf, its 
closed subgroups are the groups 1 + l d -Zi = (1 + £ d ), d = 1, 2, . . . , oo, with 
the convention l°° = 0. 

Since £ — 1 and £ are relatively prime, the subgroup Im x loc of Z\ decom- 
poses as a direct product: 

where t is a divisor of I — 1, Cj is cyclic of order £ and m = 1, 2, . . . or oo. 

Remark. An alternative definition of the invariants t and m is: 

t = [k(zi) : k] = fc-degree of zg 

m = upper bound of the d > 1 such that z d is contained in k(z£). 

Examples. If fc = Q or Q^, \ too is surjective and we have t = £—1, m = 1. If 
k = k s , then x £00 is trivial and i = 1, to = oo. If A; is finite with g elements, 
Im x e oo is the closed subgroup of Z\ generated by q and we have: 

t = order of q in F| 
m = - 1) = v e {q^ 1 - 1) . 

4.2. The case £ = 2. Here Z?j = C 2 x {1 + 4-Z 2 }, where C 2 = {1, -1} and 
the multiplicative group 1 + 4-Z 2 is isomorphic to the additive group Z 2 . 
There are three possibilities for Im x 200 : 

(a) Imx 2 oc = 1 + 2 m -Z 2 = (1 + 2 m ), with m = 2, . . . ,oo. We put t = 1. 

(b) Imx 2 oc = (-1 + 2 m ), with m = 2, . . . , oo. We put t = 2. 

(c) Imx 2 oc =C 2 x {l + 2 m -Z 2 } = (-l,l + 2 m ), m = 2,...,oo. We put 
t = 2. 

If m < oo, these types are distinct. If m = oo, types (b) and (c) coincide; 
in that case Im x 200 is equal to C 2 . 

Remark. We have t = [k(i) : k] with the usual notation i = Z4. Hence case 
(a) means that —1 is a square in k, and in that case m is the largest d > 2 
such that z , G k. 

2 d 

If t = 2, case (c) is characterized by the fact that —1 belongs to Imx 200 - 
As for m, it is given by: 

m = — 1 + upper bound of the d > 2 such that z^ d G in case (b) 

m = upper bound of the d > 2 such that z 2d G in case (c). 

Examples. If fc = Q or Q 2 , we have type (c) with t = 2,m = 2. If k = R, 
we have types (b) and (c) with m = 00. If /c is separably closed, we have 
type (a) with t = 1 and m = 00. 

When char(/c) 7^ 0, type (c) is impossible unless m = 00. If fc is finite 
with q elements, we have type (a) with m = v 2 (q — 1) if q = 1 (mod 4) and 
type (b) with m = v 2 (q + 1) if q = — 1 (mod 4). 
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4.3. The case of finitely generated fields. Let ko be the prime subfield 
of k, i.e. Q if char(/c) = or F p if char(fc) = p > 0. Suppose that k is finitely 
generated over ko. Then the invariant m is finite, i.e. Imx £OC , is infinite. 

Indeed, if not, there would be a finite extension k' of k containing the 
group jj, of all the £ d -th roots of unity (d = 1,2,...). Let K = fco(/i) be the 
extension of fco generated by [i. Then: 

(a) K is algebraic over ko 

(b) K is finitely generated over ko (because it is contained in k', cf. 
[A V], §14, cor. 3 to prop. 17). 

Hence K is either a finite field or a number field, which is absurd since 
such a field only contains finitely many roots of unity. 



§5. The S-bound 

We start with the case of tori: 

5.1. The S-bound for a torus: statements. 

Theorem 4. Let T be a torus over k, and let A be a finite subgroup ofT(k). 
Then 

TdimT 
vnA) < m — — — 

where m and t are defined as in §4 above and ip is Euler's totient function. 
The bound given by th.4 is optimal. More precisely: 

Theorem 4'. Assume m < oo. For every n > 1 there exist a k-torus T of 
dimension n and a finite subgroup A ofT(k) such that v^(A) = m - [n/<p(t)]. 

Example. Take k = Q and £ = 2, so that t = m = 2. Then th.4 says that 
any finite 2-subgroup of T(Q) has order < 4 dim T , and th.4' says that this 
bound can be attained. 

5.2. Proof of theorem 4. 

Lemma 4. Let u £ M n (Z£) be an n x n matrix with coefficients in 
which we view as an endomorphism of (Q^/Z^)". Then 

^(ker(w)) = det(u)) . 

Proof. This is clear if u is a diagonal matrix, and one reduces the general 
case to the diagonal one by multiplying u on the right and on the left by 
invertible matrices. □ 

Now let n be the dimension of the torus T. Let Y{T) = Hom^ (G m , T) be 
the group of cocharacters of T. The action of on Y(T) gives a homomor- 
phism p : T k -)• Aut (Y(T)) = GL n (Z). If we identify T with G m x • • • x G m 
(over k s ) by choosing a basis of Y(T), the ^°°-division points of T(k s ) form 
a group isomorphic to (Q^/Z^) n and the action of g G on that group is 
by p(g)x(g), where x = X^- 
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Lemma 5. Let A be a Unite subgroup of T(k). For every g E r& we have 
ve(A) < v e (det(p(g)x(g) -1)) =v £ {det(p(g- 1 )-x(.g))- 

Proof. By replacing A by its £-Sylow subgroup, we may assume that A is 
an ^-group, hence is contained in the ^-division points of T(k s ). Since the 
points of A are rational over k, they are fixed by g, i.e. they belong to 
the kernel of g — 1. The inequality then follows from lemma 4, applied to 

« = p(g) x(g) - !■ D 

We now choose g £ such that the inequality of lemma 5 gives that of 
th.4. Here is the choice: 

x(g) = z tu, where z t E has order t, and V((l— u) = m. 

(This works for £ = 2 as well as for £ ^ 2, thanks to the definition of t in 
§4.1 and §4.2. Note that in all cases but £ = 2, type (c), \{g) ls a topological 
generator of Imx-) 

We have p(g) £ GL n (Z), and p(g) is of finite order (because the image 
of p : Tfc — > GL„(Z) is finite). Hence the characteristic polynomial F of 
p{g~ l ) is a product of cyclotomic polynomials: 

(5.2.1) F = JJ with Vidj) = n. 

The inequality of lemma [5] gives 

Vi(A) <J2vi(<f> dj (z t u)). 
We thus need to compute Vi(<&d(ztu)) for every d > 1. The result is: 
Lemma 6. We have 



if d = t 

v e ($d(ztu)) = { 1 ifd = t-£ a ,a>l or a = -1 (if t = 2 = £) 
otherwise. 

Proof. (We restrict ourselves to the case £ ^ 2. The case £ = 2 is analogous 
but slightly different.) 

We have <&d( z t u ) = Y\( z t u ~ z ) w here z runs through the primitive d-th 
roots of unity in Q^. Write d as d = 5-£ a with (5, £) = 1 and a > 0. The 
images of the z's in the residue field of are primitive 5-th roots of 
unity. If 8 ^ t, none of them is equal to the image of ztu, which has order t. 
In that case, all the ZfU — z are units in hence have valuation and we 
have v^[^d{ z tu)) = 0. If 6 = t, i.e. d = t-£ a with a > 0, there are two cases: 

(a) a = 0, i.e. d = t. In that case, one of the z's is equal to zt and we 
have vi(ztu — z) = v^(u — 1) = m; the other z 7 s contribute 0. 

(b) a > 1. Here z can be written as z' ■ z" where z' runs through the 
i-th primitive roots of 1, and z" through the £ a -th primitive roots of 1. The 
valuation of z — ztu is unless z 1 = Zt, in which case vg{z — ztu) = v^(z" — u). 
It is well-known that Vf(z" — 1) = . Since vi(u — 1) = m, which is 
strictly larger, we have 

v e (z" -u) = v e ((z» - 1) - (u - 1)) = {£ _ = 
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Since the number of the z" is <p{( a ), we thus get ^(^(zju)) = 1, as 
claimed. □ 

We can now prove theorem 4: With the notation of (|5.2.ip . denote by r\ 
the number of j's with dj = t, and by r 2 the number of j's with dj = t-l a i , 
a.j > 1, or a.j = —1 in case £ = 2,t = 2. Using lemmas 5 and 6 we get 

ve(A) < rim + r 2 

and of course 

ncp(t) +^2tp(t-£ a i) <n = dimT. 

Since (p(t-£ a i) > tp(t)(£ - 1) this shows that nip(t) + r 2 ip(t)(£ - 1) < n. 
Hence r\ + r 2 (£ — 1) < [n/tp(t)], and we have: 

vi(A) < rim + r 2 < rim + r 2 {£ — l)m < m[n/cp(t)] , 

which concludes the proof. □ 

Remark. Since (£ — l)m > in all cases (even if £ = 2), the above proof 
shows that vi(A) can be equal to m[n/ip(t)] only when r 2 = 0. In other 
words: 

Complement to theorem 4. Assume Vi(A) = m[n/{p(k)], where 
n = dimT. If g £ T/% is such that x{d) = z tu, with Vi(u — 1) = m 
as above, the characteristic polynomial of p(g) is divisible by (&t) N > with 
N=[n/ip(k)]. 

(In other words, the primitive t-th. roots of unity are eigenvalues of p{g) with 
multiplicity N .) 

When t = 1 or 2 (i.e. when ip(t) = 1), this can be used to determine the 
structure of an "optimal" T: 

Corollary. Assume t = 1 or 2, and vi(A) = mn. Then : 

(i) If t = 1, the torus T is split (i.e. isomorphic to the product of n 
copies of G m ). 

(ii) Ift = 2, T is isomorphic to the product ofn non-split tori of dimension 
1 which are split by the quadratic extension k{z()/k if £ ^ 2 and by k(i)/k 
if £ = 2. 

Proof. We give the proof for t = 2 and £ > 2: the case t = 1 is easier and 
the case t = 2 = £ requires similar, but more detailed, arguments. 

Let 7 6 IV We may write x{l) as e 7 -n 7 , with e 7 G {1,-1} and 
ii 7 £ {1 + £ m 7j(}. There are three cases: 

(a) e 7 = —1 and ^(u 7 — 1) = m 

(b) e 7 = —1 and ^(u 7 — 1) > m 

(c) e 7 = 1. 

In case (a), the "complement" above shows that ^(7) has —1 for eigenvalue 
with multiplicity n, hence p{^f) = —1 in Aut(T) ~ GL n (Z). 

In case (b), choose g £ of type (a); this is possible by the very definition 
of t and m. The element g 2/ y is of type (a) (this uses the fact that £ is odd); 
hence we have p{g 2 'y) = — 1 and since p(g) = — 1 this shows that p{^) = — 1. 

If 7 is of type (c), then 57 is of type (a) or (b) and we have p(g~f) = — 1 
hence p{^) = 1. 
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In all cases, we have ,0(7) € {1,-1}, and more precisely ^(7) = e 7 . The 
corollary follows. □ 

It would be interesting to have a similar classification for t > 2. 

5.3. Proof of theorem 4': construction of tori with large A's. To 

prove th.4' it is enough to construct a fe-torus T, of dimension n = <p(t), such 
that T(k) contains a cyclic subgroup of order £ m . Here is the construction: 
Let K be the field k(z t ) if I ^ 2 and the field k(i) if t = 2. It is a 
cyclic extension of k of degree t with Galois group Ct- Let T\ = R K / k G rn 
be the torus: "multiplicative group of K"; we have T\{k) = K*, and T\{k) 
contains the group {z em }, cf. §4. If a is a generator of Ct, a acts on T\, 
and we have a 1 — 1 = in the ring End(Ti). Let us write the polynomial 
X 1 — 1 as ^t(X)-^(X), where <3?j is the t-th cyclotomic polynomial. We have 
$ t (cr)*(cr) = in End(Ti). Let T = Im*(cr) be the image of 

9(a) : Ti -> Ti. 

One checks that 

(a) dimT = 

(b) T(/c) contains . 

(For £ 7^ 2, (b) follows from the fact that the restriction of *(cj) to (z tm ) is 
an automorphism. For I = 2, use the fact that T is the kernel of $ t (cr).) 
Hence T has the required properties. □ 

Alternate description of T. It is enough to describe its character group 
T* = Hom fes (T, G m ), together with the action of T k on T*\ 

- T* = Z[X]/® t (X) = algebraic integers of the cyclotomic field Q(/t t ) 

- T k acts on T* by T k ->• Imx <00 C t ^ Aut (Q(/x t )). 

(It does not matter which isomorphism of Ct onto Aut (Q one chooses; 
they all give isomorphic tori.) 

5.4. The S-bound for reductive groups. Recall, cf. §3.3, that "reduc- 
tive" => "connected". 

Theorem 5. Let G be a reductive group over k, of rank r, with Weyl group 
W. If A is a finite subgroup of G(k), one has 



vt{A) < m 



+ v t (W). 



Proof. As usual, we may assume that A is an ^-group. In that case it is 
nilpotent, and by the corollary to th.3" of §3.3 there exists a maximal k- 
torus T of G whose normalizer ./V = Nq(T) contains A. Put Wt = N/T; 
this is a finite A:-group such that Wt(^s) — W. If At denotes the intersection 
of A with T(k), we have an exact sequence 

1 — > Ax — > A — > W T {k) . 

Hence ve(A) < V£(At) + By th.4, we have V(,(At) < m - [r/ip(t)]; 

on the other hand Wx(k) is isomorphic to a subgroup of W, hence t^(Wr(A;)) 
< ve(W). The theorem follows. □ 
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Corollary. If r < (p(t), then G(k) is l-torsion free (i.e. does not contain 
any elements of order €) . 



Proof. We have 



0. Hence by th.5 it is enough to show that 



vi(W) = 0, but this follows from th.l of §1.1 since W is isomorphic to 
a subgroup of GL r (Z) and r < tp(t) < t < £ — 1. □ 

Remark. The "S-bound" given by th.5 looks a priori rather coarse: 

(a) The torus T is not an arbitrary torus of dimension r; the fact that it 
is a subtorus of G puts non-trivial conditions on it; for instance the action 
of Tfc on T* = Honi£. s (T, G m ) stabilizes the set of roots. 

(b) The group Wx(k) is in general smaller than W itself, and the image 
of N(k) — >■ Wx(k) may be even smaller. 

It is therefore surprising how often the S-bound is close to being optimal. 
As an example, take k = Q and G of type E%. We have m = 1 and t = t — 1 
(except when t = 2 in which case m = t = 2), r = 8, \W\ = 2 14 3 5 5 2 7. The 
S-bound tells us that, if A is a finite subgroup of G(Q), its order divides the 
number 

M s = 2 30 -3 13 -5 6 -7 5 -13 2 -17-19 - 31. 

We shall see later (cf. §6.3.2 and §7) that the best bound is M = Ms/5-7-17 . 
In particular, the ^-factors of Ms are optimal for all €s except t = 5,7 and 
17. 



§6. The M-bound 

6.1. Notation. From now on, G is a semisimple Q group over k. We denote 
by R its root system (over k s ), by W its Weyl group, and by r its rank. 
The group W has a natural linear representation of degree r. The invari- 
ants of W acting on Q[xi, . . . , x r ] make up a graded polynomial algebra of 
the form Q[Pi, . . . ,P r ], where the Pi are homogeneous of degrees di, with 
d\ < c?2 < • • • < d r , (Shephard-Todd theorem, cf. e.g. [LIE V], §5, th.4 or 
[Se 00], p. 95). The d^s are called the invariant degrees of W (or of G). One 
has 

[Jdi = |W| and J2{2di - 1) = dimG. 

When G is quasi-simple (i.e. when R is irreducible) d r is equal to the Coxeter 
number h = (dimG)/r — 1, and one has the symmetry formula 

di + d r+ i-i = h + 2 . 

Moreover, if j < h is prime to h, then j+1 is one of the d^s. These properties 
make di, . . . , d r very easy to compute (see e.g. the tables of [LIE VI]). 

For instance, for G of type E 8 , the dj's are: 2, 8, 12, 14, 18, 20, 24, 30. 

Let Dyn(ii) be the Dynkin diagram of R. There is a natural action of the 
Galois group on Dyn(i?): this follows from the fact that Dyn(R) can be 
defined intrinsically from G/k s (cf- [LIE VIII], §4, no 4, Scholie, or [SGA 3], 

^We could also accept inner forms of reductive groups, for instance GL n or more 
generally GL_d, where D is a central simple fc-algebra with [D : k] = n 2 . In that case, one 
has r = n, the d;'s are the integers 1,2, ... ,n and th.6 is valid, with the same proof. 
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expose XXIV, §3, p. 344). In what follows (with the only exception of §6.7) 
we make the assumption that the action of on Dyn(i?) is trivial: one 
then says that G is of inner type (it can be obtained from a split group Go 
by a Galois twist coming from the adjoint group of Go). 

Examples of groups of inner type : 

- SL n , or more generally, SL^, where D is a central simple algebra over k. 
-Any group G whose root system has no non-trivial automorphism, e.g. any 
group of type Ai,B r ,C r , G 2 ,F 4 , E 7 , E 8 . 

6.2. Statement of the theorem. We fix £, k, and the root system R of 
G. Recall that Im^oo is a closed subgroup of Z|. Define: 

M(£, k, R) = inf v ^ " X ) = H E v *lX*~ ~ 1) • 

This is either an integer > or oo (it is oo if and only if the invariants m, t 
of k defined in §4 are such that m = oo and t divides one of the d^s, see 
prop. 4 below). 

Theorem 6. Let A be a finite subgroup ofG(k). Then ve(A) < M (£,k, R). 
(Recall that G is semisimple of inner type, cf. §6.1.) 

This is what we call the "M-bound" for vi(A). It will be proved in §6.5 
below by a method similar to Minkowski's. We shall see in Lect. Ill that it 
is "optimal" except possibly in the case £ = 2, type (c) of §4.2. 

For computations, it is useful to write M(£, k, R) explicitly in terms of 
the invariants t and m of §4: 

Proposition 4. (1) If £ / 2 or if £ = 2, t = 1 (case (a)), one has 
M(£,k,R)= (m + v e {di)) 

i 

di= (modi) 

(2) If £ = 2 and t = 2 (cases (b) and (c)) , one has 

M (2, k, R)=n + mr + v 2 {W) , 

where ro (resp. r\) is the number of indices i such that di is even (resp. di 
is odd). 

Proof. Let us begin with the case £ ^ 2. One shows first that, if t\d, one has 
ve(x d — 1) > m + vg(d) for every x £ Imx,™- (This is easy, since x can be 
written as zu with z* = 1 and ve(u — 1) > m, hence x d — 1 = u d — 1.) 

This already shows that M(£,k,R) > J2t\d, ( m + To prove the 

opposite inequality, one chooses x G Im x eoo of the form zu with z of order 
t and V((u— 1) = m. One gets (1). 

The same argument works if £ = 2 and t = 1. If £ = 2 and t = 2, one has 

V2(x d — 1) > m + V2{d) if d is even 
v 2 (x d -l) > 1 if d is odd, 
for every x £ Im x 2DO • This gives: 

M(2,k,R)> ^ 1+ E ( m + v 2(rfi)) =ri + mr + w 2 (W). 

odd even 
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To get the opposite inequality, observe that x = — 1 + 2 m belongs to Imx 2CO 
and check that Yl v^{x 1 — 1) is equal to r\ + mr§ + V2(W) . □ 

Corollary. Let a(t) be the number of indices i such that di = (mod t). If 
a{t) = 0, then G(k) is i-torsion free. 

Indeed, if a(t) = 0, the sum occurring in prop. 4 is an empty sum, hence 
M(£, k, R) = and one applies th.6. □ 

6.3. Two examples: A\ and E%. We take k = Q, so that t = £ — 1 and 
m = 1 if £ > 2 and t = m = 2 if I = 2. 



6.3.1. Type A\. There is only one di, namely d\ = 2, and prop. 4 gives: 

'3 if £ = 2 
M(4Q,Ai) = < 1 if £ = 3 
if £ > 3 . 

V 

In other words, every finite subgroup of G(Q) has an order which divides 
2 3 -3. This bound is optimal in the following sense: 

(a) The split adjoint group PGL2(Q) contains both a subgroup of order 
3 and a dihedral subgroup of order 8 (but no subgroup of order 24) . 

(b) The simply connected group SLh(Q), where H is the standard quater- 
nion division algebra, contains a subgroup of order 24 which is isomorphic to 
SL2(Fa). However the split group SL^Q) does not contain any subgroup 
of order 8 (but it does contain cyclic subgroups of order 3 and 4). 

6.3.2. Type E 8 . If we define M{Q,E S ) as \[ f £ MM ' Eli) , prop.4 gives: 

M(Q,E S ) = 2 30 -3 13 -5 5 -7 4 -ll 2 -13 2 -19-31, see e.g. [Se 79], §3.3. 

By th.6, the order of every finite subgroup of G(Q) divides M(Q,E$). As 
we shall see in the next lecture, this multiplicative bound is optimal. 



6.4. A Chebot are v- style result. We need such a result in order to gen- 
eralize Minkowski's method of §1. 

Let L be a normal domain which is finitely generated over Z as a ring, 
and let k be its field of fractions. If d = dim(L) denotes the Krull dimension 
of L ([AC VIII], §1), one has (loc.cit., §2): 

d = 1 + tr.deg(fc/Q) if char(fc) = 

d = tr.deg(/c/F p ) if char(fe) = p>0. 

Let Specmax(L) be the set of the maximal ideals of L (= set of closed 
points of Spec(L)). If x G Specmax(L), the residue field k(x) = L/x is finite 
(see e.g. [AC V], p. 68, cor. 1). We put Nx = \k(x)\; it is the norm of x. 

When d = 0, L is a finite field, and Specmax(L) has only one element. If 
d > (e.g. when char (A;) = 0), then Specmax(L) is infinite. More precisely, 
the Dirichlet series z(s) = J2 x l/(Nx) s converges for Re(s) > d, and one 
has 



(6.4.1) z(s) ~ log (l/(s - d)) when s^d (with s > d) . 
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See [Se 65], §2.7, which only contains a sketch of proof; complete details 
(for a slightly weaker statement) can be found in [Pi 97], App. B 0; sec 
also [FW 84], pp.206-207. 

Let now n be an integer > 1 which is invertible in L (and hence in k). 
Let Xn '■ Tfc — > (Z/nZ)* denote the n-th cyclotomic character of k, cf. §4.0. 
As in §4, we shall be interested in Imxn C (Z/raZ)*. 

Theorem 7. Let c be an element of (Z/nZ)* ; and let X c be the set of all 
x £ Specmax(L) such that Nx = c (mod n). Then : 

a) If c ^ Im Xn, then X c = . 

b) If c & Im^n and d > 0, then X c is Zariski-dense in Specmax(L) (or 
in Spec(L), this amounts to the same). In particular, X c is infinite. 

A more concrete formulation of b) is that, for every non-zero / € L, there 
exists an x with / x and Nx = c (mod n). 

Example. Take L = Z[l/n]. Then Specmax(L) is the set of all prime 
numbers which do not divide n, and th.7 translates into Dirichlet's theorem 
on the existence of primes in arithmetic progressions. 

Proof of theorem 7. The group C = Im Xn is the Galois group of the cyclo- 
tomic extension k(z n )/k. Let L n be the integral closure of L in k{z n ). One 
checks by standard arguments that the ring extension L n /L is finite and 
etale. In geometric terms, Spec(L n ) — > Spec(L) is a finite etale covering. 
The group C acts freely on Spec(L„), with quotient Spec(L). For every 
closed point x of Spec(L), the Frobenius element cr x of x is a well-defined 
conjugacy class of C (hence an element of C since C is commutative). More- 
over, if we view C as a subgroup of (Z/nZ)*, cr x is the image of Nx in Z/nZ. 
This proves a). 

Suppose now that d > and that c belongs to C = Imxn- Let z c (s) be 
the Dirichlet series ^ l/(Nx) s , where the sum is over the elements x of X c . 
The general Chebotarev density theorem ([Se 65], [Pi 97]) gives: 

(6.4.2) z c (s) ~ t— 7 log(l/(s — d)) when s — > d with s>d. 

I I 

In particular, we have z c (d) = +oo. If the Zariski closure X c of X c were of 
dimension < d— 1, we would have z c {d) < oo, as one sees by splitting X c 
into irreducible components, and applying (6.4.1). Hence b). □ 

6.5. Proof of theorem 6. Let A C G(k) be as in th.6. We want to prove 
that 

v e (A) < M(£,k,R). 

We do it in three steps: 

2 When char(fc) = one can give a stronger statement, in the spirit of the Prime 
Number Theorem: 

For every X > 2, call ttl(X) the number of a; £ Specmax(L) such that Nx < X. Then: 

ttl(X) = (l/d)X d /logX + 0(X d /log 2 X) when X -> oo . 

The general Chebotarev density theorem can also be stated (and proved) in terms of such 
"natural" density (standard method: use Weil-Deligne estimates to reduce everything to 
the known case d — 1). 
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6.5.1. The case where k is finite. Put q = \k\. It is well-known that 

\G(k)\=q N ]l(q li -l), where N = \R\/2 = Y,(^-l)- 

This shows that v^{A) < ^2vi(q d ' — 1). The integer q, viewed as an element 
of Z| , is a topological generator of Im x eoa ■ Hence every element u of Im x e oo 
is an ^-adic limit of powers of q and this implies that ve(u d — 1) > ve(q d — 1) 
for every d > 1. Hence the lower bound which defines M(£, k, R) is equal to 
^2ve(q dl — 1); this proves th.6 in the case where k is finite. 

6.5.2. The case where k is finitely generated over its prime subfield. 

By 6.5.1, we may assume that k is infinite. We need a subring L of k, with 
field of fractions k, which has the following properties: 

(a) L is normal, finitely generated over Z and contains l/l. 

(b) G comes by base change from a semisimple group scheme G over L, 
in the sense of [SGA 3], XIX. 2.7. 

(c) A is contained in the group G(L) of the L-points of G. 

Lemma 7. There exists such an L. 

This is standard, see e.g. [EGA IV], §8.1 □ 

Let us now choose (L,G) with properties (a), (b) and (c). For every 
x G Specmax(L), the fiber G^. of G at x is a semisimple group over k(x), 
of type R. Moreover, the Dynkin diagram of G is finite etale over Spec(L), 
cf. [SGA 3], XXIV.3.2; since it is "constant" for the generic fiber (i.e. 
over k) it is constant everywhere; this shows that the G^. are of inner 
type. The inclusion map i : A — > G(L) gives for every x a homomorphism 
i x : A — > G(k(x)). Since i is injective, there is an open dense subset Xq of 
Specmax(L) such that i x is injective for all x £ Xq. We thus get: 

v e (A) < v e (G(K(x))) = J2v e ((Nx) dt - l) for all x (£ X , 

cf. 6.5.1. Let u be any element of Im^™ • By applying th.7 to the image of 
u in (Z/FZ)* with j = 1, 2, . . . , we find a sequence of points Xj of Xq such 
that lim Nxj = u in Z^ . We have: 

^'-l)=limV^((iVx/-l), 

J— >oo * — » 

and applying the formula above to each of the x^-'s we obtain 

ve(A) < ^2v e (u dl - 1) for every u 6 Imx <00 . 

This proves th.6 in the case 6.5.2. 

[Variant: One reduces the general case to the case where dim(L) = 1 by using 
Hilbcrt's irreducibility theorem, as explained in [Sc 81], p. 2; in the case dim(L) = 1, 
one can apply the standard Chebotarev theorem instead of the general one.] 

6.5.3. The general case. The same argument as for lemma 7 shows that 
G comes by base change from a semisimple group G' over a subfield k! 
of k which is finitely generated over the prime subfield of k (i.e. F p or 
Q). Moreover, one may assume (after enlarging k! if necessary) that A 
is contained in G'(k'). The Galois group Ty acts on the Dynkin diagram 
Dyn(i?) of G' (which is the same as the one of G). Let k" be the Galois 
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extension of k' corresponding to the kernel of IV — > Aut Dyn(i?). Since G 
is of inner type over k, the field k" is contained in k. By base change to k", 
G' gives a semisimple group G" which is of inner type and we may apply 
6.5.2 to (G",A). We get v e (A) < M(£,k",R). Since k" is contained in k, 
we have M(£,k",R) < M(£,k,R) : the group Imx^oo can only decrease by 
field extensions. Hence vz(A) < M(£,k,R). □ 

6.5.4. Remark. Surprisingly, the proof above does not really use the hy- 
pothesis that A is a subgroup of G(k). It uses only that A acts freely on 
G, viewed merely as a k- variety (and not as a homogeneous space); this is 
indeed enough to ensure that V((A) < v^(G(k)) when k is finite. Here is an 
example: take G = SL2, I = 2, k = Q; the M-bound is 3, which means that 
any finite 2-subgroup of SL2(Q) has order < 8. As was said in §6.3.1, there 
is in fact no subgroup of order 8 in SL2(Q). But one can make a cyclic 
group of order 8 act freely on the variety SL2: take for instance the group 
generated by the automorphism: 

'a b\ fd-c -c-d\ fO -l\ fa b\ f 1 T 

c 



d) ^ li^) (^1 I = U J [c dl[-l 1 



Hence, even in this bad-looking case, the M-bound can claim to be "op- 
timal" . 

6.6. An analogue of Sylow's theorem. 

Theorem 8. Let A and A 1 be two finite £-subgroups of G(k). Assume that 
vi(A) is equal to the M-bound M(£,k,R). Then there exists y € G(k) such 
that yA'y' 1 C A. 

Corollary. If both A and A' attain the M-bound, then they are geometrically 
conjugate (i.e. conjugate in G(k)). In particular, they are isomorphic. 

Proof. We may assume that k is finitely generated over its prime subfield. 
If it is finite, th.8 is just a special case of Sylow's theorem. Let us assume 
that k is infinite, and choose L, G as in §6.5.2 with A, A' C G(L). Let Y be 
the subscheme of G made up of the points y with yA'y^ 1 C A. Let X be 
the set of all x G Specmax(L) such that Nx, viewed as an element of Z», is 
of the form ztu with z t of order t and v^(u— 1) = m (note that m is finite, 
cf. §4.3). It follows from th.7, applied to n = £ m+1 , that X is Zariski-dense 
in Spec(L). If x € Specmax(L), the groups A and A' inject into G(k(x)) 
(this is an easy consequence of the hypothesis that £ is invertible in L). If 
moreover x belongs to X, then the same computation as in §5.2 shows that 
ve(G(n(x)) is equal to the M-bound, hence A is an £-Sylow of G(k(x)). By 
Sylow's theorem, this shows that A' is conjugate in G(k(x)) to a subgroup 
of A. In particular, the fiber at x of Y — > Spec(L) is non-empty. Since X 
is Zariski-dense, this implies that the generic fiber Y/ k of Y — s- Spec(L) is 
non-empty, i.e. that Y(k) is non-empty. □ 

Remark. One can show that Y is smooth over L, and hence that Y(k s ) / 
which is slightly more precise than Y(k) / 0. 



Exercise. Show that a family of polynomial equations with coefficients in Z has a 
solution in C if and only if it has a solution in Z/pZ for infinitely many p's. 
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6.7. Arbitrary semisimple algebraic groups. In the previous sections, 
we have assumed that G is of inner type, i.e. that the natural homomorphism 

e : T k -> Aut Dyn(P) 

is trivial. Let us now look briefly at the general case, where no hypotheses 
on e are made. In order to state the result which replaces th.6 we need to 
introduce the linear representations £d of T k defined as follows: 

Let S = Q[Pi, . . . , P r ] be the Q-algebra of VF-invariant polynomials, cf. 
§6.1. Let I = (Pi, ... ,P r ) be the augmentation ideal of S; put V = I/I 2 . 
The vector space V is of dimension r, and is graded; the dimension of its 
d-th component Vd is equal to the number of indices i with di = d. The 
group Aut Dyn(P) acts on S, V and the V^'s; by composing this action with 
e, we get for each d > a linear representation 

e d : T k -> Aut(Vd) . 

Theorem 6'. Let A be a Unite subgroup of G(k). Them 

ve(A)< inf YsMdetix^igf-Edig))) 
a e i k ~r 

d 

(The determinant is relative to the vector space Vd<8)Qe.) 

Proof (sketch). The method is the same as the one used for th.6. There are 
three steps: 

(1) Reduction to the case where k is finitely generated over its prime 
subfield; this is easy. 

(2) Reduction to the case where k is finite, via the general Chebotarev 
density theorem instead of th.7. 

(3) The case where k is finite. In that case, if q = \k\, and if a is the 
Frobenius generator of T^, one has (cf. e.g. [St 68] th. 11.16) 

vi (G(k)) = X>( det(q d - Ed(cj))) = £>( det(x^ (<j) d - e d (a))) 
d d 

hence the desired formula: 

(*) v e (A) < Mtet(x e ~ (g) d - e d (g))) 

d 

in the special case g = a. By applying this to the finite extensions of k, one 
sees that the inequality (*) is valid for all a™, n = 1, 2, . . . , and hence for all 
g € Tk, since the a n are dense in Tfc. □ 

Remark. One may also prove th.6' using ^-adic cohomology, cf. §6.8. 

Example. Take R of type A%, so that Aut Dyn(P) = {1, —1} and e may 
be viewed as a quadratic character of T^. The V^'s are of dimension 1 for 
d = 2, 3 and are otherwise. The action of Aut Dyn(P) on Vd is trivial for 
all d, except d = 3. Hence £2 = 1, £3 = £, and th.6' can be rewritten as: 

v e (A) < inf {v e ( Xeoc (g) 2 - 1) + v e (x tpo (gf - e(g)) } . 
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A similar result holds for the types A r (r > 2), D r (r odd) and Eq, with 2 
(resp. 3) replaced by the even c/j's (resp. the odd dj's). 

6.8. The cohomological method. Let us consider first the general sit- 
uation suggested in §6.5.4 where a finite group A acts freely on a quasi- 
projective k- variety X. As explained in [II 05], §7, one can then give an 
upper bound for vg(A) in terms of the action of on the etale cohomology 
of X. More precisely, let H l c (X) denote the i-th etale cohomology group 
of Xij ts , with proper support and coefficients Q^; it is a finite dimensional 
Q^-vector space which is for i > 2-dim(A). There is a natural action of 
Tk on Hl(X), and, for each g € Tk, one can define the "Lefschetz number" 
Ax (g) by the usual formula: 

AxG?) = J>iyTr( 5 |i^(A)) . 

i 

One has Ax{g) € Zg. Moreover: 
Theorem 6". v e (A) < mf ge r k v e (A x (g)) . 

Proof. See [II 05], §7, especially cor. 7.5. The proof follows the same pattern 
as the other proofs of the present §: one uses Chebotarev to reduce to the 
case where k is finite, in which case the result follows from the fact, due to 
Grothendieck, that, if a is the (geometric) Frobenius generator of T^, then 
Ax(cr) is equal to |A(fc)|, hence is divisible by \A\ since the action of A is 
free. (As in the proof of th.6', one applies this, not only to a but also to its 
powers a n , n > 0, and one uses the fact that the a n are dense in T/, .) □ 

If one applies th.6" to A C G(k), with A acting by left translations on 
X = G, one recovers th.6 and th.6', thanks to the known structure of the 
cohomology of G, cf. e.g. [SGA 4±], p. 230. 

6.9. The Cremona group: open problems. Recall that the Cremona 
group Cr r (k) is the group of A:- automorphisms of the field k(X\, . . . , X r ), i.e. 
the group of birational automorphisms (or "pseudo-automorphisms" , cf. [De 
70]) of the projective r-space over k. For r = 1, one has Cri(fc) = PGL2(&). 
Let us assume that r > 2. As explained in [De 70], Cr r is not an algebraic 
group, but looks like a kind of very large semisimple group of rank r (very 
large indeed: its "Weyl group" is the infinite group GL r (Z)). Not much is 
known about the finite subgroups of Cr r (k) beyond the classical case r = 2 
and k algebraically closed. Here is a question suggested by §5.1: 

- Is it true that Cr r (fc) has no ^-torsion if ip{t) > r? 
In the special case k = Q, r = 2 or 3, this amounts to: 

- Is it true that the fields Q(Xi,X2) and Q(Xi,X2,Xs) have no auto- 
morphism of prime order > 11? (Automorphisms of order 2, 3, 5 and 7 do 
exist.) 

It would be very interesting to attack these questions using cohomology, 
but I do not see how to do this. It is not even clear how to define coho- 
mological invariants of Cr r (C), since there is no natural topology on that 
group. Still, one would like to give a meaning to a sentence such as 

"Cr r (C) is connected for r > 1 and simply-connected for r > 2." 
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III. Construction of large subgroups 
§7. Statements 

We keep the notation of Lecture II: k, £, x e ao , t, m, .... We consider only 
semisimple groups over k with a root system R which is irreducible. The 
M-bound of §6.2 will be denoted by M(£, k, R); it only depends on the pair 
{£, k) via the invariants t and m, and on R via the degrees di, . . . , d r of W. 
We limit ourselves to the case m < oo; see §14 for the case m = oo. 

A pair (G, A), where G is of inner type with root system R, and A C G(k) 
is a finite group, will be called optimal if Vi(A) is equal to the M-bound 
M(£,k,R). (We could assume that A is an ^-group, but this would not be 
convenient for the constructions which follow.) Our goal is to prove: 

Theorem 9. If £ ^ 2, an optimal pair (G,A) exists (for any k, R). 

Theorem 10. If £ = 2, an optimal pair (G, A) exists i/Imx 2 oo is of type 
(a) or (b) in the sense o/§4.2 (i.e. if lmx 200 can be topologically generated 
by one element). 

Theorem 11. In the case 1 = 2 and type (c), there exists (G,A) with 

v 2 (A) = r m + v 2 (W) 
where ro is the number of indices i such that di is even. 

Note that here the M-bound is M(2, k,R) = r\ + r^m + V2(W) with 
7T = r — ro, cf. §6.2, prop. 4. Hence v 2 (A) differs from M(2,k,R) only by 
r\. In particular, A is optimal if r\ = 0. Hence: 

Corollary. // all the di's are even (i.e. if — 1 G W), then an optimal pair 
(G, A) exists for £ = 2 (and hence for all ^'s, thanks to th.9). 

This applies in particular to the exceptional types G 2 , F4, Ej and E%. 

Remarks. (1) The simplest case where the M-bound is not attained is k = Q, 
£ = 2, R of type A 2 , where m = 2, ro = 1, r = 2, the M-bound is 4, and it 
follows from [Sch 05] that v 2 (A) < 3 for every finite subgroup A of G(Q). 

(2) In Theorems 9, 10 and 11, no claim is made on the structure of G 
except that it is of inner type and that its root system is of type R. However, 
if one looks closely at the proofs given in the next sections, one sees that G 
can be chosen to have the following properties: 

- it is simply connected; 

- it splits over the cyclotomic field k(ze) if £ > 2, and over k(i) if £ = 2. 

Simple examples (such as k = Q, £ = 3, G of type G 2 ) show that it is not 
always possible to have G split over k. 

(3) If G is not chosen carefully, the group G(k) may not contain any large 
^-subgroup, even if k contains all the roots of unity. For instance, when R 
is of type A\ (resp. of type Eg) it is easy (resp. it is possible) to construct 
a pair (G, k) such that the only torsion elements of G(k) have order 1 or 2 
(resp. G(k) is torsion free). 

(4) The three theorems above are almost obvious if the characteristic is 
p / (especially Theorem 11 since type (c) never occurs!): one takes a 
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finite field ko contained in k which has the same invariants t and m (this 
is easily seen to be possible - if k is finitely generated over F p , one chooses 
the maximal finite subfield of k), and one takes for G the group deduced by 
base change from a split group Go over ko with root system R. If we choose 
for A the finite group Go(ko), it is clear from the way we got the M-bound 
that vi(A) = M(£, fc , R) = M(£, k, R), so that (G, A) is optimal. 

In what follows, we shall assume that char(fe) = 0. Note also that we 
could replace k by any subfield having the same invariants t and m, for 
instance the intersection of k with the field of £°°-roots of unity. We could 
thus assume that k is a cyclotomic number field, if needed. 

The proof of Theorem 9 will be given first for classical groups (§9), by 
explicit elementary constructions similar to those of Schur. The more inter- 
esting case of exceptional groups (§12) will use different methods, based on 
Galois twists (§10), Tits groups and braid groups (§11). The case of I = 2 
will be given in §13. The last section (§14) is about m = oo. 

§8. Arithmetic methods (k = Q) 

These methods are not strong enough to prove the statements of §7, but 
they give very interesting special cases. 

8.10. Euler characteristics. Here, the ground field is Q. One starts 
from a split simply connected group scheme G over Z (this makes sense, 
cf. |SGA 3j ). One may thus speak of the group T = G(Z) of the integral 
points of G. It is a discrete subgroup of G(R). Its Euler characteristic xCO 
( "caracteristique d'Euler-Poincare" in French) is well-defined (see [Se 71] 
and [Se 79]); it is a rational number. Moreover it is proved in [Ha 71] that 

(8.10.1) x{ r) = cf[\((l-d l ) = cfl^ 1 

i=l i=l 1 

where is the d-th Bernoulli number, £ is the zeta function and 
c= |W|/|Wk-| where Wk is the Weyl group of a maximal compact sub- 
group K of G(R). Assume that all di's are even (if not, all the terms in 
(|8.10.ip are zero). Using standard properties of Bernoulli numbers, one can 
check that the M-bound relative to i is M = vt( den (|C(1 ~~ <^))) ; where 
"den" means denominator. Hence, if I does not divide c, and does not di- 
vide the numerator of any |C(1 — di) (which is the case if i is a so-called 
regular prime), one sees that the denominator o/EP(r) is divisible by £ M . 
But a theorem of K. Brown [Br 74] shows that this is only possible if T con- 
tains a finite subgroup of order £ M . Hence we get an optimal pair (provided 
(c, I) = 1, and I is regular, say). 

Example. Take G of type Eg; here c = 3 3 -5, and the numerators of the 
|C(1 — di) do not cancel any denominator. Hence one obtains that a split 
Eg, contains an optimal A for all £ ^ 3,5, with the extra information that 
A can be found inside the group T = G(Z) - but no information on what it 
looks like! 
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8.11. Mass formulae. In [Gr 96j . B. Gross considers Q-forms of G such 
that G(R) is compact] he also requires another condition which guarantees 
that G has a smooth model over Z. This condition is fulfilled for types B, 
D, G2, F4 and E%. He then proves a mass formula a la Minkowski ( [Gr 96| . 
prop. 2. 2): 

E ^ = 11^(1-4) 

i=i 

where the A a are the Z-points of the smooth models of G over Z (taken up 
to conjugation). Each A a is finite. It is then clear that, if £ N is the £-th 
part of the denominator of Yii=i ^C(l ~~ (k), the ^-Sylow subgroup of one of 
the A a has order > £ N . If is equal to the Minkowski bound M (which 
happens if £ does not divide the numerator of any of the ^C(l ~~ di)), then 
such a Sylow has order £ M , and we get an optimal pair. Note that there is 
no extra factor "c" as in (j8.10.ip . This works very well for G2, F4, Eg (and 
some classical groups too, cf. |Gr 96 ]): 

G2 - Here the mass is \ ({— 1)C( — 5) = ^6337' ano ~ ^ * s obtained with just 
one A a , which turns out to be isomorphic to G2(F2). 

F4 - There are two A a 's and the mass formula is 

' + oi, J 7-0 = 4C(-i)C(-5)C(-7)C(-H) 



2 15 -3 6 -5 2 -7 2 12 -3 5 -7 2 -13 



16 

691 



2 i5 . 36 . 52 . 72 . 13 ■ 



Eg - Here the numerator is very large, but the denominator is exactly what 
is needed for the M-bound, namely: 

2 30 . 313 . 55 . 7 4 . 11 2 . 13 2 . ig . 31 _ 



§9. Proof of theorem 9 for classical groups 

Here £ ^ 2. Recall that Imx f0 o = C t x {1 + ^ m Z £ }, where m > 1 and t 
divides i — 1. The M-bound is 

M= J2 {m + v £ (di)). 

i 

di=0(mod t) 

We denote by K the field k(z() generated by a root of unity of order I. It 
is a cyclic extension of k, of degree t, with Galois group Ct- It contains z e , n 
but not z , 1 , cf. §4.1. 

£771+1 " O 

9.1. The groups An and A^. If is an integer > 1, we denote by A tv 
the subgroup of GLn(K) (where K = k(zg) as above) generated by the 
symmetric group Sn and the diagonal matrices whose entries are ^ m -th 
roots of unity (wreath product of Sn with a cyclic group of order £ m ). We 
have 



(9.1.1) 



v e (A N ) = mN + v e (N\). 
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The image of detx : An — > K* is {±1} x (z em ). Hence the kernel ^4^- is 
such that 

(9.1.2) v i (Aj f ) = m(N-l)+v i (N\). 

We are going to use An, and sometimes ^4^, in order to construct optimal 
subgroups for the classical groups SL„, SO„ and Sp n ; this is what Schur 
did in |Sch 05| . §6, for the case of GL„. 

9.2. The case of SL„. The d^s are 2, 3,. . ., n. If we put N = we have 

(9.2.1) M = mN + Vi(Nl) if*>2, 

(9.2.2) M = m(N - 1) + v e (N\) if t = 1, in which case N = n. 

In the case t > 2, we take An C G~Ln(K) C G~LNt(k), and observe that 
detfc(^4/v) is equal to ±1 (indeed, if g E An, then det&(<?) = N K / k (detK(9)) 
and one checks that N K / k (z pn ) = 1). This shows that an ^-Sylow of An is 
contained in ShNt(k) and hence in SL„(fc). By (|9.2.ip we get an optimal 
pair. 

In the case t = 1, we use the same construction with ^4^ instead of An- 
The comparison of (|9,1.2p and (|9.2.2p shows that we get an optimal pair. 

9.3. The case of the orthogonal and symplectic groups, t odd. Let 

us consider the case of Sp 2n . The d^s are equal to 2, 4, ... , In. Hence, if we 
put JV = [2] , the M-bound is mN + vg(N\). There is a natural embedding: 

GLat ->■ Sp 2N -)■ Sp 2n 

( x \ 

defined by x i-> I ^ t j J . The image of A^v by that embedding is optimal. 

The same construction works for S02n and SO^n+i- (Note that, in all 
these cases, we get the split forms of the groups of type B n , C n , D n . This 
is no longer true in the case t is even - nor in the cases of §12.) 

9.4. The case of the orthogonal and symplectic groups, t even. 

Since t is even, the group C% = Gal(K/k) contains an element a of order 
2; its image in ZJ is —1. Let Kq be the subfield of K fixed by a; we have 
[K : Kq] = 2, [Kq : k] = to with to = t/2. Moreover a(z em ) is equal to 
(^m) _1 ; i.e. a acts on z im just as complex conjugation does. Let us define 
an hermitian form h on K N (where N is a given integer > 1) by the standard 
formula 

N 

K x ,y) = ^ J x i -a(y i ), if x = (xi, . . .,x N ), V = {yi, ■ ■ -,Vn)- 

i=l 

If Ujv denotes the unitary group associated with h, it is clear that the group 
An defined in §9.1 is contained in XJn{K). [We use here the traditional 
notation XJn{K) for the unitary group; this is a bit misleading, since is 
an algebraic group over Kq, and we are taking its -Ko-points.] 

Let 5 € K* be such that a (6) = —5, e.g. 5 = Z£ — zj l . We have 
K = Kq © S-Kq, and h(x, y) can be decomposed as 

h(x, y) = qo{x, y) +5 ■ b (x, y), with q (x, y) G Kq, b (x, y) E K . 
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Then qo (resp. 60) is a non-degenerate symmetric (resp. alternating) K$- 
bilinear form of rank 2N . 

Its trace q = Tr Ko / k qo(x, y) (resp. b = Tix /k bo(x, y)) is of rank 2Nt$ = Nt 
over k. We thus get embeddings: 

(9.4.1) A N -»• V N (K) -> SO 2N (K ) -> SOrf) 

(9.4.2) Aiv Ujv(if) -> Sp 2JV (^ ) -> S Pjvt (fc). 

Now, for a given n, let us define iV by A?" = [^p] = 
an embedding 

Aat -> Sp^/c) -> Sp 2n (A;) ) 

and one checks that it is optimal. 

The same method gives an embedding of An into SOm(k), hence into 
SC>2n+i(&)) and this embedding is also optimal. As for S02 n (/c), one has to 
be more careful. The method does give an embedding of An into the S02« 
group relative to some quadratic form Q, but we have to ensure that such 
an SC>2n group is of inner type i.e. that disc(Q) = (-l) n in k*/k* 2 . There 
are three cases: 

a) If 2n > Nt (i.e. if t does not divide 2n), we choose Q = q ® qi, where 
qi has rank 2n — Nt, and is such that disc(g) • disc(gi) = (— l) n . We then 
have An C S0 2n) g(fc) and this is optimal. 

b) If 2n = Nt and N is even, we have disc(g) = d , where d = disc(ATo/fc), 
hence disc(q)= 1 in k*/k* 2 , which is the same as (— l) n since n is even. 

c) If 2n = Nt and N is odd, we use an optimal subgroup A of S0 2 „_i(fc) 
relative to a quadratic form qo of rank 2n — 1. By adding to go a suitable qua- 
dratic form of rank 1, we get a quadratic form of rank 2n and discriminant 
(— l) n , as wanted. The corresponding embedding 

A^S0 2 „-i(fc)^SCMA:) 

is optimal. (Note that the d^s for type D n are 2, 4, . . . , 2n — 2, and n. Hence, 
if i /fn, the M-bound for D n is the same as the M-bound for £?„_i.) 

§10. Galois twists 

To handle exceptional groups, we have to use twisted inner forms instead 
of split ones. We shall only need the most elementary case of twisting, 
namely the one coming from a homomorphism ip : — > Aut(G). Let us 
recall what this means (cf. for example [Se 64] . chapter III): 

Let K/k be a finite Galois extension. Let X be an algebraic variety over k, 
assumed to be quasi-projective (the case where X is affine would be enough). 
Choose a homomorphism 

tp : Gal(K/k) -> Aut fe X. 

The twist X v of X by 92 is a variety over k which can be characterized as 
follows: 

There is a A'-isomorphism Q:Xj^ — > X V ^ K such that ~f(0) = o 93(7) for 
every 7 € Gal(K/k). 



to 



By (9.4.2), we get 
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(Here X/ K denotes the K- variety deduced from X by the base change 
k — ?> K, and 99(7) € Aut^ X is viewed as belonging to Autx X/ K .) 

One shows (as a special case of Galois descent) that such a pair (X^,#) 
exists, and is unique, up to isomorphism. 

It is sometimes convenient to identify the iT-points of X and X^ via the 
isomorphism 9. But one should note that this is not compatible with the 
natural action of Gal(K/k) on X(K) and X^K); one has 

7(0(x)) = ¥>(7)(7(a;)) if 7 € Gal{K/k),x € X{K). 

In other words, if we identify X V {K) with X(K), an element 7 of Gal(ivT/A;) 
acts on Xp(K) by the twisted action : 

x ^ ^(7) (7(2)) 

In particular, the k-rational points of X^ correspond (via 6~ l ) to the points 
x € X(K) such that 7(2:) = v?(7 _1 )x for every 7 E Gal(i^//c). 

In what follows we apply the 92-twist to X = split form of G, with ^(7) 
being a k- automorphism of G for every 7 € Gal(i^/fc). In that case, G<p is 
a fc-form of G\ this form is inner if all 93(7) belong to G ad (-KT) where G ad is 
the adjoint group of G. The effect of the twist is to make /c-rational some 
elements of G which were not. In order to define (p, we shall have to use the 
/^-automorphisms of G provided by the Tits group W*, see next section. 

§11. A general construction 

Here, G is a split simply connected group over k, and T is a maximal split 
torus of G. We put N = N G (T) and W = N/T is the Weyl group. 

11.1. The Tits group. The exact sequence 1— > T ^ N — > W — > I does 
not split in general. However Tits ([Ti 66aj . [TT 66b]) has shown how to 
construct a subgroup^ W* of N(k) having the following properties: 

(1) The map W* — > W is surjective. 

(2) The group W* D T is equal to the subgroup T2 of T made up of the 
points x of T with x 2 = 1. 

We thus have a commutative diagram, where the vertical maps are inclu- 
sions: 

1 — > T 2 — » W* — > — > 1 

4- 4" 4" 4" 
l^r^N w 1 

We refer to Tits (loc. cit.) and to Bourbak^ ([LIE X], pp. 115-116, exerc. 
12, 13) for the construction and the properties of W*. For instance: 

If G comes from a split group scheme G over Z, then W* is equal to iV(Z), 
the group of integral points of the group scheme A r . 



The construction of W depends on more than (G,T): one needs a pinning 
("epinglage") of (G, T) in the sense of |SGA 3j . XXIII. 1.1. 

4 Bourbaki works in the context of compact real Lie groups; his results can easily be 
translated to the algebraic setting we use here. 
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In the case of SL n , this means that one can choose for W* the group of 
monomial matrices with non-zero entries ±1 and determinant 1. For n = 2, 

W* is the cyclic group of order 4 generated by 

Note also that W* is a quotient of the braid group associated to W . 
(For the definition of the braid group of a Coxeter group, see e.g. I>.\1 97] .) 

11.2. Special elements of W. We now go back to our general notation 
l,m,t,... of Lecture II. Recall that the M-bound M = M(£, k, R) is given 

by ' 

(11.2.1) M = ^2(m + v £ (di)) , cf. §6.2. 

t\di 

Let a(t) be the number of indices i such that di = (mod t). We may rewrite 
(fTTXTT) as 

(11.2.2) M = ma(t) + ^ v £ (di). 

t\di 

Note that, if no di is divisible by t, we have M = and the trivial group 
A = 1 is optimal. Hence we shall assume in what follows that a{t) > 1. 

Let now w be an element of W. We shall say that w is special (with 
respect to t and €) if it has the following four properties: 

(1) w has order t in W. 

(2) w is the image of an element w* of W* such that (w*)* G T 2 n C(G), 
where C(G) is the center of G. 

(3) The characteristic polynomial of w (in the natural r-dimensional rep- 
resentation of W) is divisible by ($t) a ^, where <]?£ is the i-th cyclotomic 
polynomial. 

(Equivalently: if 2^ denotes a primitive t-th root of unity, then z% is an 
eigenvalue of w of multiplicity at least a(t).) 

(4) Let Cw(w) be the centralizer of w in IF. Then: 

(C w (w)) >J2 v i(di)- 

t\di 

Remark. The reader may wonder whether special elements exist for a given 
pair (t,£) (with a(t) > and 1 = 1 (modi), of course). The answer is 
"no" in general: if R is of type C3 and t = 4, no element of W* has both 
properties (1) and (2). Fortunately, the answer is "yes" for the exceptional 
types G 2 ,...,-E , 8 , cf. §12. 

Example : the regular case. Suppose that w £ W is regular of order t in the 
sense of Springei|j ( |Sp 74| , bottom of p. 170 - see also |BM 97] . §3). This 
means that w has an eigenvector v, with eigenvalue Zt, such that v does not 
belong to any reflecting hyperplane. Then w is special (for any t with 1 = 1 
(mod t)). Indeed: 

^With a slight difference: Springer requires t > 1 and we don't; it is convenient to view 
w = 1 as a regular element of W. 

Note that, if t is given, there is a very simple criterion ensuring the existence of a regular 
element of W of order t: the number of indices i such that di = (mod t) should be equal 
to the number of i's such that di = 2 (mod t), cf. Lehrer-Springer |LS 99] . cor. 5. 5. 
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(1) is obvious. 

(2) follows from the fact, proved in [B M 97] . §3, that w has a lifting w 
in the braid group B^y with w* = 7r, where tt has an image tt in W which 
belongs to T 2 D C(G). In Bourbaki's notation ([LIE X], p. 116) tt is the 
canonical element zq of the center of G. 

(3) is proved in Sp 74 , th. 4.2. 

, th. 4.2, in the stronger form |Cjy(W)| = Y\ t \ di d%. 



(4) is proved in 



Sp 74 



Special cases 

t = 1. Here w = 1 and w* = tt (one could also take w* = 1). 

t =2. Here w = wq = longest element of W . When —1 belongs to W, 
one has u>o = — 1 and Wq is central in W* (because wo is central in By/, cf. 
[BM 97| . 1.2 and 3.4). In that case the inner automorphism of G defined by 
Wq is a "Weyl-Chevalley involution": it acts on T by t H- 1 . 

11.3. An auxiliary result. 

Lemma 7. Suppose w £ W is special of order t. Then it is possible to 
choose a lifting w* of w in W* which satisfies 

(2*) (w*Y G T 2 n C(G) 
and 

(4*)«/(C W r.K))>E t | (i( « / (d i ). 

Proof. Let P be an £-Sylow of CV(w); the groups P and (if) commute, and 
P n (w) = 1 since to has order i and £ is prime to t (since £ = 1 modi). 
Hence the group generated by iu and P is the direct product P x (w). 
Since £ ^ 2, its 2-Sylow subgroup is contained in (w). Put C2 = T 2 H C(G). 
We have an exact sequence: 

1 -> T 2 /C 2 -> W*/C 2 ->■ W ->■ 1. 

By property (2) of w, this exact sequence splits over (w), hence over the 
2-Sylow of P w ; since the order of T 2 /C 2 is a power of 2, this implies that 
it splits over P w . We thus get an element w' of W* /C2, of order t, which 
lifts w, and centralizes a subgroup P' of !v"*/C 2 isomorphic to P. We then 
choose for w* a representative of u>' in W*; it has property (2*), moreover its 
centralizer contains the inverse image of P', which is canonically isomorphic 
to C 2 x P' . By property (4) we have 

v l {P')=v i {P)>Y J Vl{di)- 
t\di 

This shows that w* has property (4*). □ 

Remark. In the case where w is regular, one can do without lemma [71 
Indeed the braid group construction of [B M 97j gives a lifting w* of w having 
property (2*) and such that the map Cw*(w*) — > CV(fw) is surjective. 

11.4. The main result. 

Proposition 5. Suppose W contains an element w which is special with 
respect to t and £. Then there exist an inner twist G v of G (cf. §10) and 
a finite £-subgroup A of G v (k) such that the pair (G V ,A) is optimal in the 
sense of §7. 
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(In particular, th.9 is true for (k,£,R).) 

Proof. As in §9, we put K = fc(^), where is a root of unity of order I. 
Let Ct = Gal(K/k); it is a cyclic group of order t. 

Choose w* £ W* with the properties of lemma [7] and let cr be the inner 
automorphism of G defined by w* . Since a has order t, there exists an 
injective homomorphism: 

if : C t -> G &d (k) C Aut fc (G) 

which maps Ct onto the subgroup (a) of Autfc(C) generated by a. As ex- 
plained in §10, we may then define the ip -twist G v of G, relatively to the 
Galois extension K/k. The group G^ is an inner form of G; it has the same 
root system R. It remains to construct a finite ^-subgroup A of G v {k) such 
that (Gtp,A) is optimal, i.e. vi(A) = ma(t) + Yltidi v t(di)i cf- (11-2.2). 

We take for A the semi-direct product E m ■ P, with E m C T v (k) and 
P C Nu,(k), where E m and P are defined as follows: 

(1) P is an £-Sylow of Cw* (w*). By lemma[7|we have v^{P) > Ylt\di v e{^i)- 
Note that the points of P are fixed by a. Hence these points are rational 
over k not only in the group G but also in the group G v . 

(2) E m is the subgroup of T v {k) made up of the elements x such that 
x = 1. 

It is clear that P normalizes E m , and that P n £/ m = 1. 

Lemma 8. T/ie group E m contains a product of a{t) copies of the group 
Z/£ m Z. 

This implies that vi(E m ) > ma(t) and hence 

v e (A) = v e (E m ) + v e (P) > ma(t) + v e (di). 

t\di 

We thus get vg(A) > M and since M is an upper bound for vi(A) we have 
Vi(A) = M. □ 

Proof of lemma 8. Consider first the subgroup T^m of T(k s ) made up of 
the elements x with x 1 ™ = 1. Since T is /c-split, and K = k(zi) = k{z(m) 
(cf. §4 and §9), the points of T^m are rational over K. If we write T(m{K) 
additively, it becomes a free Z/£ m Z-module of rank r and the action of a 
generator s of Ct is by x i— > sx, where s is identified with an element of order 
t in Z| (i.e. s ="^t" with our usual notation for roots of unity). As for the 
action of w* (i.e. of w) on Tgm(K), it can be put in diagonal form since w is 
of order t and t divides I — 1; its diagonal elements are r elements yi, . . . , y r 
of Z/£ m Z, with y\ = \. Let c be the largest integer such that (&t) c divides 
the characteristic polynomial of w. By property (3) of 11.2, we have c > a(t) 
(in fact, c = a(t), by |Sp 74| , th. 3.4). This implies that the family of the 
UiS contains c times each primitive i-th root of unity (viewed as element of 
(Z/e m Z)*). In particular, there is a Z/^Z-submodule X of T t m{K) which 
is free of rank c and on which w acts by x i— > 2^ 1 a;. If we twist C7, T, T^m by 
the new action of Ct = Gdl(K/k) on X is trivial (cf. end of §10). This 
means that X is contained in T^^k), hence in E m , which proves the lemma. 

□ 
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Note the following consequence of proposition 4: 

Corollary. If W contains a t-regular element in the sense of |Sp 74| ; then 
theorem 9 is true for k, £, R. 

In the case t = 1, no twist is necessary (one takes w = 1, w* = 1, cf. 
§11.2). 



§12. Proof of theorem 9 for exceptional groups 

In each case we will show that the Weyl group contains an element w 
which is special with respect to t and £, so that we may apply prop. 5. 

12.1. The case of G2. The degrees di are d\ = 2,^2 = 6. Since t divides 
one of them, t is a divisor of 6, hence is regular ( |Sp 74| , no. 5.4). We may 
then apply prop. 5. □ 

Explicit description of w, w*: if c is a Coxeter element of W, c is of order 
6, and every lifting c* of c in W* has order 6. Hence, for any divisor t of 6, 
we may take w = c 6//t and w* = (c*) 6// *. 

12.2. The case of F4. The d^s are: 2, 6, 8, 12. All their divisors are regular 
(Springer, loc. cit.). One concludes as for G2. □ 

12.3. The case of E 6 . The dj's are: 2, 5, 6, 8, 9, 12. All their divisors are 
regular, except t = 5. In that case, choose any element w 6 W of order 5. 
Since the kernel of W* — > W is a 2-group, w can be lifted to an element w* 
of W* of order 5. Conditions (1) and (2) of §11.2 are obviously satisfied. 
The same is true for condition (3), since o(5) = 1 (only one of the dj's is 
divisible by 5), and w has at least one eigenvalue of order 5. As for condition 
(4), it is trivial, since £ = 1 (mod 5) implies £ > 11, and £ does not divide 
any of the d^s, so that Ylt\di v i(di) is 0. Hence w is special with respect to 
(5,£). □ 

12.4. The case of E 7 . The dja are: 2, 6, 8, 10, 12, 14, 18. By |Sp 74| , 
loc. cit. all their divisors are regular except 4, 5, 8, 10, 12. If t = 4, 5, 8 or 
12, t already occurs for E$, with the same values of a(t), namely 2, 1, 1 and 
1. Hence, we have ^-special elements ^4,105,^8 and W12 in W(Eq). One 
then takes their images in W(-EV) by the injective map W(Eq) — > W(Ej), 
and one checks that they are ^-special (here again condition (4) is trivial 
since ve(di) = for all the ts with £ = 1 (modi)). 

As for t = 10, one takes w = —W5, which makes sense since — 1 £ W. 
The element —1 (usually denoted by Wq) can be lifted to a central element 
e of W* with e 2 € T2 nC(G); this is a general property of the case —l£W 
(which reflects the fact that —1 is 2-regular, see end of §11.2). Hence, if w§ 
is a lifting of W5 of order 5, £w§ is a lifting of w of order 10, and this shows 
that w is special with respect to 10 and I. □ 

12.5. The case of E 8 . The dj's are: 2, 8, 12, 14, 18, 20, 24, 30. By |Sp 74| , 
loc. cit., all their divisors are regular except 7, 9, 14, 18. 

If t = 7 (resp. 9), one chooses w-j G W of order 7 (resp. tug G of order 
9). Since 7 and 9 are odd, condition (2) of §11.2 is satisfied. The same is true 
for condition (3) because a{t) = 1, and for condition (4) because vi(di) = 
for all i. 
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If t = 14 (resp. 18), one takes w = —wj (resp. w = — wg), as we did for 
E 7 . □ 

§13. Proof of theorems 10 and 11 

Here £ = 2. There are three cases (cf. §4.2): 

(a) Imx 2 oc = 1 + 2 m 7i 2 with m > 2. In that case the M-bound is 
rm + V2(W), and th.10 asserts that an optimal pair (G,A) exists for ev- 
ery type R. 

(b) Imx 2 oc = (-l + 2 m ), with m > 2. The M-bound is r m + n + v 2 (W), 
where ro (resp. r{) is the number of i's such that d, is odd (resp. even). 
Here, too, th.10 asserts that an optimal pair exists. 

(c) Imx 200 = (-1, 1 + 2 m ), with m > 2. 

The M-bound is the same as in case (b), but th.ll does not claim that 
it can be met (i.e. that an optimal pair exists); it merely says that there is 
a pair (G, A) with v 2 (A) = r§m + v 2 (W); such a pair is optimal only when 
r\ = 0, i.e. when —1 belongs to the Weyl group. 

13.1. Proof of theorem 10 in case (a). We take G split and simply 
connected, and we choose a maximal split torus T. We use the notation 
(N,W,W*) of §11. Let E be the 2-torsion subgroup of T(k). Since T is 
isomorphic to the product of r copies of G m , E is isomorphic to a product 
of m copies of Z/2 m Z, cf. §4.2. Hence v 2 (E) = rm. The group E is 
normalized by the Tits group W*; we define A as A = E-W*. The exact 
sequence 

1 ->■ E -> A^W 1 
shows that v 2 {A) = rm + v 2 (W). Hence (G, A) is optimal. 

13.2. Cases (b) and (c). As in §11.4, we start with a split G, with a split 
maximal torus T. We define N, W, W* as usual. After choosing an order on 
the root system R, we may view W as a Coxeter group; let u>q be its longest 
element. It has order 2, and it is regular in the sense of Springer |Sp 74| . As 
explained in §11.2, this implies that there is a lifting Wq of wq in W* which 
has the following two properties: 

(i) its square belongs to the center of G; 

(ii) the natural map Cw*(w;q) — > C\y(wo) is surjective. 

Let a be the inner automorphism of G defined by Wq. By (i), we have a 2 = 1. 
Let K = k(i) and let <p be the homomorphism of G&l(K/k) into Aut^(G) 
whose image is {l,cr}. Let us define G^ as the ip-twist of G, in the sense 
defined in §10. Denote by T v , and W* the (^-twists of T, N and W*. We 
have an exact sequence 

1 -> T v -> N v -> W v -»• 1, 

where Wm is the p-twist of W. Note that W ip (k) is equal to the centralizer 
C\y(wo) of wq in W, and similarly W*(k) is equal to C w , (wq). 

As in §13.1, let E be the 2-torsion subgroup of T^k). It is normalized by 
C w , (wq). Define A C G v (k) to be the group A = E ■ C w , (wg). By (ii), we 
have an exact sequence: 

1 -> S ->■ A -> C^(^o) -> 1 , 
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which shows that v 2 (A) = V2(E) + ^(^(wo))- The fact that wq is regular 
of order 2 implies that 

\Cw(w )\ = JJ<ij, 

2\di 

hence v 2 (Cw{wq)) = ^ v 2 (di) = v 2 (W) . This gives: 

(13.2.1) v 2 (A) = v 2 (E) + v 2 (W). 

Proposition 6. We have : 

v 2 (E) = n + rom in case (b) 
v 2 (E) = rom in case (c). 

In case (b), this shows that (G V ,A) is optimal, which proves th.10. Sim- 
ilarly, the fact that v 2 (A) = rom + v 2 (W) proves th.ll in case (c). 

13.3. Proof of proposition 6. We need to describe explicitly the torus 
T<p. To do so, let us first define the following two tori: 

G G m = 1-dimensional torus deduced from G m by Galois twist relatively 
to K/k. Its group of fc-points is K\ = KerA^^ : K* — >■ k*. 

R K i k G m = 2-dimensional torus deduced from G m by Weil's restriction 
of scalars relatively to K/k. Its group of fc-points is K*. 

Lemma 9. The torus T v is isomorphic to the product of r\ copies of 
R K i k G m and ro — r\ copies of G^. 

Proof. The character group X = Hom(T, G rn ) is free of rank r, with basis 
the fundamental weights u)i,...,u) r . This gives a decomposition of T as 

T = Ti x T 2 x • • • x T r , 

where each T« is canonically isomorphic to G m . Let r = — wq be the oppo- 
sition involution of the root system R; it permutes u±, . . . ,co r with r\ orbits 
of order 2, and ro — r\ orbits of order 1. (This follows from the fact that 
— 1 is an eigenvalue of wq of multiplicity ro.) The involution r permutes the 
tori Tj. If an index j is fixed by r, then ^o acts on Tj by t t^ 1 and the 
twisted torus (Tj)^ is isomorphic to G^; similarly, if r permutes j and f, 
the torus (Tj x Tj/)^ is isomorphic to R K / k G m . This proves lemma 9. □ 

End of the proof of prop. 6. The 2-torsion subgroup of G^(fc) = K\ is cyclic 
of order 2 m ; the 2-torsion subgroup of Rx/kG m (k) = K* is cyclic of order 
2 m+1 in case (b) and of order 2 m in case (c). We get what we wanted, 
namely: 

case (b): v 2 (E) = r\(m + 1) + (ro — r\)m = r$m + r\ 
case (c): v 2 (E) = r\m + (ro — r{)m = rom. 
This completes the proof of prop. 6, and hence of th.10 and th.ll. □ 

13.4. Remarks on the non simply connected case. The proof above 
could have been given without assuming that the split group G is simply 
connected. The main difference is in lemma 9: in the general case, the torus 
T v is a product of three factors (instead of two): 

T v = (G m ) a x (G^ x (R K/k G m y , 
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where a, /3, 7 are integers, with /3 + 7 = ro and a + 7 = ri. This gives the 
following formulae for V2(E) : 

case (b) : V2(E) = a + /3m + j(m + 1) = r± + r^m 

case (c) : V2(E) = a + /3m + 7m = a + rom . 
In case (b) one finds the same value for V2(A), namely the M-bound. 
In case (c) one finds a result which is intermediate between the M-bound 
r\ + rom + V2(W) and the value rom + V2(W) given by th.ll. 

Examples (assuming we are in case (c)). 

- Type A r , r even. One finds that a is always 0, so that one does not gain 
anything by choosing non simply connected groups. Indeed, in that case, it 
is possible to prove, by a variant of Schur's method, that the value of V2(A) 
given by th.ll is best possible. 

- Type A r , r odd > 3. Here r\ = (r — l)/2. One finds that a = if r = 1 
(mod 4), but that a can be equal to 1 if r = 3 (mod 4). When r = 3, we 
thus get a = r\\ this shows that the M-bound is best possible for type A3. 

- Type D r , r odd. Here r± = 1, and if one chooses G neither simply 
connected nor adjoint, one has a = 1. This means that the orthogonal 
group S02r has an inner fc-form which contains an optimal A. (Note the 
case r = 3, where D3 = A3.) 

- Type Eq. Here r\ = 2, and one has a = both for the simply connected 
group and for the adjoint group (indeed, a is for every adjoint group). 
I do not know whether the bound of th.ll is best possible in this case. 

§14. The case m = 00 

14.1. Statements. We keep the notation (G, R, W, di,£, t, m) of §4 and §6; 
as before, we assume that G is of inner type. 

We consider the case m = 00, i.e. the case where the image of Xi°° JS 
finite; that image is then cyclic of order t, cf. §4. 

Let a(t) be the number of i's such that di = (mod t). If a(t) = 0, then 
G(k) is ^-torsion free, cf. §6.2, cor. to prop. 4. In what follows, we shall thus 
assume that a(t) > 1. In that case, G(k) may contain infinite ^-subgroups 
(we say that a group is an i-group if every element of that group has order 
a power of £). The following two theorems show that a(t) controls the size 
of such a subgroup: 

Theorem 12. Let A be an l-subgroup of G(k). Then A contains a subgroup 
of finite index isomorphic to the i-group (Q^/Z^) a ; with a < a(t). 

(Note that Q^/Z^ is the union of an increasing sequence of cyclic groups 
of order £, £ 2 , . . . ; it is the analogue of Z/£ m Z for m = 00.) 

The bound a < a(t) of th.12. is optimal. More precisely: 

Theorem 13. There exist a semisimple group G of inner type, with root 
system R, and an £- subgroup A of G(k), such that A is isomorphic to the 
product of a(t) copies o/Q^/Z^. 

14.2. Proof of theorem 12. We need a few lemmas: 
Lemma 10. Any finitely generated £-subgroup of G(k) is finite. 
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Proof. Let B be a finitely generated ^-subgroup of G(k). We may embed B 
in GL n (k) for n large enough. By a known result (see §1.2) there exists a 
subgroup B' of B, of finite index, which is torsion-free if char(fc) = 0, and 
has only p-torsion if char(fc) = p. Since B' is an £-group, this means that 
B' = 1, hence -B is finite. □ 

Lemma 11. There exists a maximal k-torus of G which is normalized by 
A. (Recall that A is an ^-subgroup of G(k).) 

Proof. Let F be the set of all finite subgroups of A, ordered by inclusion. 
Lemma 10 implies that, if B\ and B2 belong to F, so does {B\, B2). Let X 
be the /c-variety parametrizing the maximal tori of G; it is a homogeneous 
space of G. If B G F, let X B be the subvariety of X fixed by B; a point 
of X B corresponds to a maximal torus of G normalized by B. By the 
noetherian property of the scheme X, one may choose Bq € F such that 
X is minimal among the X B 's. If B e F, then x( B °' B *> is contained 
in X B() , hence equal to X B °. This shows that X B ° is contained in all the 
X B 's, i.e. that every maximal torus which is normalized by Bq is normalized 
by all the B 7 s, hence by A. By the corollary to th.3" of §3.3 (applied to the 
finite £-group Bq) there exists such a torus which is defined over k. □ 

Lemma 12. Let u G M r (Z^) be anrxr matrix with coefficients in Z^, which 
we view as an endomorphism o/(Q^/Z^) r . Then Ker(u) has a subgroup of 
finite index isomorphic to the product of r — rank(it) copies of Qe/^e- 

In other words, the "corank" of Ker(ii) is equal to r — rank(n). 

Proof. Same as that of lemma 4 of §5.2: by reduction to the case where u 
is a diagonal matrix. □ 

Lemma 13. Let zt be a primitive t-th root of unity, and let w be an element 
ofW. The multiplicity of z t as an eigenvalue of w is < a(t). 

Proof. See |Sp 74| , th.3.4(i) where it is deduced from the fact that the 
polynomial det(t — w) divides IJi(* ~~ ■*-)• ^ 

Lemma 14. Let T be a maximal k-torus of G, and let T(k)i be the l-torsion 
subgroup ofT{k). We have corank T{k)z < a(t). 

As above, the "corank" of a commutative ^-group is the largest n such 
that the group contains the product of n copies of Q^/Z^. 

Proof. As in §5.2, let Y(T) = Hom/% s (G m ,T) be the group of cocharacters 
of T. The action of the Galois group T/% on Y(T) gives a homomorphism 

p : T fc -> Aut Y(T) ~ GL r (Z) 

and the image of p is contained in the Weyl group W (this is still another way 
of saying that G is of inner type). The group acts on T(k s )e ~ (Qe/Zi£) r 
by p ® Xi where x = X e °c ■ Let us now choose g G Tfc such that = z^ 1 , 
where Zt is an element of order t of Z|, and let w = p{g). The element g 
acts on T(k s )i by wz^ . Let T g be the kernel of g — 1 on T(k s )f . By lemma 
12, we have corank (T g ) = r— rank(<? — 1), which is equal to the multiplicity 
of Zt as an eigenvalue of w; using lemma 13, we get corank(T 9 ) < a(t), and 
since T(k)g is contained in T g , we have corank(T(/s)£) < a(t). □ 
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End of the proof of th.12. By lemma 11, there is a maximal /c-torus T of 
G which is normalized by A. Let A° = A n T(k). Then A° is an abelian 
subgroup of A of finite index. Since ^4° is contained in T{k)^, lemma 14 
shows that A° is isomorphic to the product of a finite group with a product 
of at most a(t) copies of Q^/Z^. □ 

14.3. Proof of theorem 13. We follow the same strategy as for theorem 
9, 10 and 11. There are three cases: 

14.3.1. Classical groups (I / 2). We change slightly the definitions of 
§9.1: we define An as the subgroup of G~Ln(K), with K = k(zi), made up 
of the diagonal matrices of order a power of £; it is isomorphic to (Q^/Z^)^. 

For any given n > 2, we put N = [n/t] and we get embeddings 

An -> GL N (K) -> GLjvt(fc) -> GL n (fc). 

If t > 1, one checks that the /c-determinant of every element of An is 1; 
we thus get an embedding An — > SL n (fc) which has the required properties 
since N = a(t) in that case. When t = 1, we replace An by the subgroup of 
its elements of fc-determinant 1, and we also get what we want. This solves 
the case of type A r . Types B r , C r and D r are then treated by the methods 
of §9.3 and §9.4. 

14.3.2. Exceptional groups (1^2). One replaces prop. 5 of §11.4 by a 
statement giving the existence of A C G^k) with A ~ (Q^/Z^)°^. The 
proof is the same. One then proceeds as in §12. 



14.3.3. The case 1 = 2. Same method as in §13. 



□ 
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Introduction 

The present text reproduces - with a number of additions - a series of three 
two-hour lectures given at the Ecole Polytechnique Federale de Lausanne 
(E.P.F.L.) on May 25-26-27, 2005. 

The starting point is a classical result of Minkowski, dating from 1887, 
which gives a multiplicative upper bound for the orders of the finite sub- 
groups of GL n (Q). The method can easily be extended to other algebraic 
groups than GL n , and the field Q can be replaced by any number field. 
What is less obvious is that: 

a) one can work over an arbitrary ground field; 

b) in most cases one may construct examples showing that the bound 
thus obtained is optimal. 

This is what I explain in the lectures. 

Lecture I is historical: Minkowski (§1), Schur (§2), Blichfeldt and others 
(§3). The results it describes are mostly well-known, so that I did not feel 
compelled to give complete proofs. 

Lecture II gives upper bounds for the order of a finite f-subgroup of G(k), 
where G is a reductive group over a field k, and I is a prime number. These 
bounds depend on G via its root system, and on k via the size of the Galois 
group of its £-cyclotomic tower (§4). One of these bounds (called here the 
S-bound, cf. §5) is a bit crude but is easy to prove and to apply. The 
second one (called the M-bound) is the most interesting one (§6). Its proof 
follows Minkowski's method, combined with Chebotarev's density theorem 
(for schemes of any dimension, not merely dimension 1); it has a curious 
cohomological generalization cf. §6.8. The last subsection (§6.9) mentions 
some related problems, not on semisimple groups, but on Cremona groups; 
for instance: does the field Q(X,Y,Z) have an automorphism of order 11 ? 

Lecture III gives the construction of "optimal" large subgroups. The case 
of the classical groups (§9) is not difficult. Exceptional groups such as E% 
are a different matter; to handle them, we shall use Galois twists, braid 
groups and Tits groups, cf. §§10-12. 
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I. History: Minkowski, Schur, ... 

§1. Minkowski 

Reference: [Mi 87]. 

1.1. Statements. We shall use the following notation: 

£ is a fixed prime number; when we need other primes we usually denote 
them by p; 

the ^-adic valuation of a rational number x is denoted by vg(x); one has 
vi(£) = 1, and v^{x) = if x is an integer with (x,£) = 1; 

the number of elements of a finite set A is denoted by \A\; we write vi{A) 
instead of t^(|A|); if A is a group, £ vl{A) is the order of an ^-Sylow of A; 
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if x is a real number, its integral part ("floor") is denoted by [x]. 
We may now state Minkowski's theorem ([Mi 87]): 

Theorem 1. Let n be an integer > 1, and let £ be a prime number. Define: 



M(n,£) 



n 



+ 



n 



l(£-\) 



+ 



n 



£ 2 (£-l)_ 



+ ... 



Then: 

(i) If A is a finite subgroup o/GL n (Q), we have vi(A) < M(n,£). 

(ii) There exists a finite £-subgroup A o/GL n (Q) with ve(A) = M(n,£). 

The proof will be given in §1.3 and §1.4. 

Remarks. 

1) Let us define an integer M(n) by: 



M(n) 



I 



Part (i) of th.l says that the order of any finite subgroup of GL n (Q) di- 
vides M(n), and part (ii) says that M(n) is the smallest integer having this 
property. Hence M(n) is a sharp multiplicative bound for \A\. 
Here are the values of M{n) for n < 8: 

M (1) = 2 

M(2) = 2 3 - 3 = 24 

M(3) = 2 4 - 3 = 48 

M(4) = 2 7 -3 2 -5 = 5760 

M(5) = 2 8 - 3 2 - 5 = 11520 

M(6) = 2 10 - 3 4 - 5 • 7 = 2903040 

M (7) = 2 11 • 3 4 • 5 • 7 = 5806080 

M(8) = 2 15 - 3 5 - 5 2 - 7 = 1393459200. 

Note that 



M(n)/M(n - 1) = < 



'2 if n is odd 

denominator of b n /n if n is even, 



where b n is the n-th Bernoulli number. (The occurence of the Bernoulli 
numbers is natural in view of the mass formulae which Minkowski had proved 
a few years before.) 

2) One may ask whether there is a finite subgroup A of GL n (Q) of order 
M(n). It is so for n = 1 and n = 3 and probably for no other value of n (as 
Burnside already remarked on p. 484 of [Bu 11]). Indeed, some incomplete 
arguments of Weisfeiler and Feit would imply that the upper bound of \A\ 
is 2 n • n\ if n > 10, which is much smaller than M(n). See the comments of 
Guralnick-Lorenz in [GL 06], §6.1. 



Exercise. Let 



— a i£ l i < Oj < ^ — 1, be the ^-adic expansion of 



e-i 



Show that M{n,l) = Y,ai^=i k = E M{a i l i {£ - 1),£). 
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1.2. Minkowski's lemma. Minkowski's paper starts with the following 
often quoted lemma: 

Lemma 1. If m > 3, the kernel o/GL n (Z) — > GL n (Z/mZ) is torsion free. 

Proof. Easy exercise ! One may deduce it from general results on formal 
groups over local rings, cf. Bourbaki [LIE III], §7. Many variants exist. For 
instance: 

Lemma 1'. Let R be a local ring with maximal ideal m and residue field 
k = R/m. If i is a prime number distinct from char(£i), the kernel of the 
map GL„(i?) —■ GL„(/e) does not contain any element of order I. 

Proof. Suppose x € GL n (i?) has order i and gives 1 in GL„(A;). Write 
x = 1 + y; all the coefficients of the matrix y belong to m. Since x l = 1, we 
have 

which we may write as y ■ u = 0, with u = i + ( 2 )y + • • • + y^~ l - The image 
of u in GL n (fc) is I, which is invertible. Hence u is invertible, and since y ■ u 
is 0, this shows that y = 0. □ 

Several other variants can be found in [SZ 96]. 

Remark. A nice consequence of lemma 1' is the following result of Malcev 
and Selberg ([Bo 69], §17): 

(*) Let r be a finitely generated subgroup of GL n (K), where K is a field 
of characteristic 0. Then T has a torsion free subgroup of finite index. 

Sketch of proof (for more details, see Borel, loc.cit.). Let S be a finite 
generating subset of T, and let L be the ring generated by the coefficients 
of the elements of S U S -1 . We have T C GL„(L). Let m be a maximal 
ideal of L; the residue field k = A/m is finite ([AC V], p. 68, cor.l to th.3); 
let p be its characteristic. The kernel ri of T — » GL n (fc) has finite index in 
T; by lemma V (applied to the local ring R = L m ), T\ does not have any 
torsion except possibly p-torsion. By choosing another maximal ideal of L, 
with a different residue characteristic, one gets a torsion free subgroup of 
finite index of Ti, and hence of F. □ 

Remark. When K has characteristic p > the same proof shows that T 
has a subgroup of finite index which is "p'-torsion free", i.e. such that its 
elements of finite order have order a power of p. 

1.3. Proof of theorem 1 (i). Let A be a finite subgroup of GL n (Q); we 

have to show that vg{A) < M(n,£). Note first: 

1.3.1. The group A is conjugate to a subgroup o/GL n (Z). 

This amounts to saying that there exists an yl-stable lattice in Q n , which 
is clear: just take the lattice generated by the A-transforms of the standard 
lattice Z n . 



LECTURE I: HISTORY: MINKOWSKI, SCHUR, 



5 



1.3.2. There is a positive definite quadratic form on Q n , with integral coef- 
ficients, which is invariant by A. 

Same argument: take the sum of the ^4-transforms of x\ + • • • + x^, and 
multiply it by a suitable non-zero integer, in order to cancel any denomina- 
tor. 

Let us now proceed with the proof of V(,(A) < M(n,£). We do it in two 
steps: 

1.3.3. The case £ > 2. 

By 1.3.1, we may assume that A is contained in GL n (Z). Let p be a 
prime number 7^ 2. By lemma 1, the map A — > GL„(Z/pZ) is injective. 
Hence 

v t (A) < a(p) = v e (GL n (Z/ P Z)). 

The order of GL n (Z/pZ) is p n ( n - 1 )/ 2 (p - \){p 2 - 1) . . . (p n - 1). Let us 
assume that p 7^ I. Then we have 



»(p) = ^v l {p i - 1). 



We now choose p in such a way that a{p) is as small as possible. More 
precisely, we choose p such that: 

(*) The image of p in (Z/£ 2 Z)* is a generator of that group. 

This is possible by Dirichlet's theorem on the existence of primes in 
arithmetic progressions (of course, one should also observe that (Z/^ 2 Z)* 
is cyclic.) 

Once p is chosen in that way, then p % — 1 is divisible by i only if i is 
divisible by £ — 1; moreover, one has vg(p — 1) = 1 because of (*), and 
this implies that vi(p l — 1) = l + vi(i) if i is divisible by £ — 1. (This is where 
the hypothesis £ > 2 is used.) One can then compute a(p) by the formula 

above. The number of indices i < n which are divisible by i — 1 is 

We thus get: 



a(p) 



n 



£-1 



n 



l-l 



n 



n 



+ 



At- 1 ). 



+ 



1- 1 

M(n,£). 



n 



£-1 







This proves th.l (i) in the case £ / 2. 



1.3.4. The case £ = 2. 

When £ = 2, the method above does not give the right bound as soon 
as n > 1. One needs to replace GL ra by an orthogonal group. Indeed, by 
1.3.1 and 1.3.2, we may assume, not only that A is contained in GL n (Z), 
but also that it is contained in the orthogonal group O n (q), where q is 
a non-degenerate quadratic form with integral coefficients. Let D be the 
discriminant of q, and let us choose a prime number p > 2 which does not 
divide D. The image of A in GL n (Z/pZ) is contained in the orthogonal 
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group O n (Z/pZ) relative to the reduction of q mod p. If we put r = [n/2], 
the order of O n (Z/pZ) is known to be: 



2 -p r \p A - l)(p 4 - 1) . . . (p 2r - 1) if n is odd. 



and 



2 • p^-^Cp 2 - l)(p 4 - 1) . . . (p 2r - l)/(p r + e) if n is even, 



with e = ±1 equal to the Legendre symbol at p of (— l) r D. 

If we choose p = ±3 (mod 8), we have V2(p 2t — 1) = 3 + V2(i), and 
U2(p r + e) > 1. If n is odd, this gives 







r" 




+ 




.2. 


.4. 



v 2 (O n (Z/pZ)) = 1 + 3r + u 2 (r!) = n + r + 
and, if n is even: 

«2(O n (Z/pZ)) < 3r + «2(r!) = M(n, 2) 
Hence i>2(^4) is at most equal to M(n, 2). 



+ 



M(n,2), 



□ 



Remark. There are several ways of writing down this proof. For instance: 

- There is no need to embed A in GL n (Z). It sits in GL„(Z[l/iV]) for a 
suitable N > 1, and this allows us to reduce mod p for all p's not dividing 
N. 

- Minkowski's lemma is not needed either: we could replace it by the 
trivial fact that a matrix which is different from 1 is not congruent to 1 
mo dp for all large enough p's. 

- Even when £ > 2, we could have worked in O n instead of GL„; that is 
what Minkowski does. 

- When £ = 2 the case n even can be reduced to the case n odd by 
observing that, if A C GL n (Q), then A x {±1} embeds into GL n+ i(Q), 
and M(n + 1, 2) is equal to 1 + M(n, 2). 



1.4. Proof of theorem 1 (ii). The symmetric group Sg has a faithful 
representation Sg — > GL(Vi) where V\ is a Q-vector space of dimension 

I -I. Put r = 



, and let V = V\ © • • • © V r be the direct sum of r copies 

of V\. Let S* be the semi-direct product of S r with the product (Si) 1 " of r 
copies of Se ("wreath product"). The group S has a natural, and faithful, 
action on V. We may thus view S as a subgroup of GL r ^_ 1 ^(Q), hence also 
of GL„(Q), since n > r(£ — 1). We have 



v e (S) = r + ve(r\) 



n 



+ 



n 



l(t-\) 



+ 



M(n,£). 



An £-Sylow A of S* satisfies the conditions of th.l (ii). □ 

Example. When £ = 2 the group S defined above is the "hyper-octahedral 
group" , i.e. the group of automorphisms of an n-cube (= the Weyl group of 
a root system of type B n ); in ATLAS notation, it may be written as 2 n • S n . 
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1.5. A conjugacy theorem. The finite ^-subgroups of GL n (Q) have the 
following Sylow-like property: 

Theorem 1'. Let A and A' be two finite (-subgroups o/GL n (Q). Assume 
that A has the maximal order allowed by th.l. Then A' is conjugate to a 
subgroup of A. 

Corollary. If\A\ = \A'\ = £ M (™< £ ) , then A and A' are conjugate in GL n (Q). 

Proof of theorem 1'. See Bourbaki, [LIE III], §7, exerc.6 f) where only the 
case I > 2 is given, and Feit [Fe 97] who does the case i = 2. Let us sketch 
Bourbaki's method (which we shall use in §6.6 in a more general setting): 

We may assume that A and A' are contained in GL„(Z). Choose a 
prime p as in 1.3.3, and reduce mod p. The groups A and A' then become 
^-subgroups of G p = GL n (Z/pZ), and A is an ^-Sylow of G p . By Sylow's 
theorem applied to G p , one finds an injection i : A' —> A which is induced 
by an inner automorphism of G p . The two linear representations of A': 

A' -»• GL n (Q) and A' 4 A -»• GL n (Q) 

become isomorphic after reduction mod p. Since p ^ £, & standard argument 
shows that they are isomorphic over Q, which proves th.l' in that case. The 
case I = 2 can be handled by a similar, but more complicated, argument: if 
n is odd, one uses orthogonal groups as in 1.3.4, and one reduces the case n 
even to the case n odd by the trick mentioned at the end of §1.3. □ 

Exercise. Let A(n) be a maximal 2-subgroup of GL„(Q). Show that the A(n)'s 
can be characterized by the following three properties: 

A(l) = {±1}. 

A(2n) = (A(n) x A(n)) ■ {±1} (wreath product) if n is a power of 2. 
A(n) = A{2 mi ) x ■■■ x A(2 m ")tfn = 2 mi -\ h 2™" with m x < ■ ■ ■ < m k . 



§2. Schur 

Ten years after [Mi 87], Frobenius founded the theory of characters of 
finite groups. It was then (and still is now) very tempting to use that theory 
to give a different proof of Minkowski's results. The first people to do so 
were Schur ([Sch 05]) and Burnside ([Bu 11], Note G). Schur 's paper is 
especially interesting. He works first over Q, as Minkowski did, and uses a 
very original argument in character theory, see §2.1 below. He then attacks 
the case of an arbitrary number field, where he gets a complete answer, see 
§2.2. 

2.1. Finite linear groups with rational trace. What Schur proves in §1 
of [Sch 05] is: 

Theorem 2. Let A be a finite l-subgroup o/GL n (C). Assume that the traces 
of the elements of A lie in Q. Then v^(A) < M(n,£), where M(n,£) is as 
in th.l. 
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The condition on the traces is obviously satisfied if A is contained in 
GL n (Q). Hence th.2 is a generalization of th.l. (As a matter of fact, it is a 
genuine generalization only when I = 2; indeed, when I > 2, it is known, cf. 
[Ro 58], that a finite ^-subgroup of GL n (C) with rational trace is conjugate 
to a subgroup of GL„(Q).) 

Proof. We start from the following general fact, which is implicit in [Sch 05] 
(and is sometimes called "Blichfeldt's lemma"): 

Proposition 1. Let G be a finite subgroup o/GL n (C) and let X be the sub- 
set of C made up of the elements Tr(g) for g G G, g ^ 1. Let N = Y[( n ~ x ) 
be the product of the n — x, for x £ X. Then N is a non-zero integer which 
is divisible by \G\. 

(Hence the knowledge of the set X gives a multiplicative bound for the 
order of G.) 

Proof. Let m = \G\, and let z be a primitive m-th root of unity. The 
elements of X are sums of powers of z; hence they belong to the ring of 
integers of the cyclotomic field K = Q(z). This already shows that N is an 
algebraic integer. If s is an element of Gal(A/Q), one has s(z) = z a for some 
a £ (Z/mZ)*. If x = Tr(<?), with g £ G, then s(x) = Tr(g a ), hence s(x) 
belongs to X. This shows that X is stable under the action of Gal(A/Q); 
hence N is fixed by Gal(A/Q); this proves that N belongs to Z. 

The factors of N are / 0. Indeed, Tr(g) is equal to the sum of n complex 
numbers zi with \zi\ = 1, hence can be equal to n only if all the Zi are equal 
to 1, which is impossible since g ^ 1. This shows that N / (one could 
also prove that A is positive, but we shall not need it). 

It remains to see that N is divisible by |G|. It is well-known that, if % is 
a generalized character of G, the sum J2 g( z G x{g) is divisible by |G|. Let us 
apply this to the function g h-> x{q) = FLeX ( Tr(g) — x) , which is a Z-linear 
combination of the characters g i— > Tr(g) m ,m > 0. Since x(<?) = f° r 9^1 
and x(l) = A, the sum of the x(<?) is equal to A. Hence A is divisible by 
\G\. ' □ 

The next lemma gives an information on the Tr(<7)'s: 

Lemma 2. Let A be as in th.2. If g € A, then Tr(g) may be written as 
n — ly with y € Z and < y < n/{i — 1). 

Proof. Each eigenvalue of g is of order t a for some a > 0, and all the 
eigenvalues with the same a have the same multiplicity. By splitting C n 
according to the a's, one is reduced to the following three cases: 

(1) g = 1 and n = 1. Here Tr(#) = 1 and we take y = 0. 

(2) g has order I and n = t — 1. Here Tr(g) = —1, and y = 1. 

(3) g has order £ a with a > 1 and n = t a ~ x {l - 1). Here Tr(#) = and 
y = £ a ~ 2 (£ - 1). 

In each case we have < y < n/(£ — 1). □ 

End of the proof of theorem 2. We apply prop.l to G = A. By lemma 2, 
each factor n — x of N can be written as ly with 1 < y < d = [n/£ — 1)]. 



LECTURE I: HISTORY: MINKOWSKI, SCHUR, 



9 



This shows that N divides the product £ d ■ d\ and we have 

v e {N) <d + v e (d\) = [n/{£ - 1)] + [n/£{£ - 1)] + • • • = M(n, £). 
Since |G| divides N, this proves th.2. □ 

Remark. One may ask whether th.2 can be complemented by a conjugacy 
theorem analogous to th.l' of §1.5. The answer is of course "yes" if £ > 2 
(because of th.l'), but it is "no" for i = 2: the dihedral group D4 and 
the quaternion group Qs are non-conjugate 2-subgroups of GL2(C), with 
rational trace, which have the maximal order allowed by th.2, namely 8. 

2.2. Replacing Q by an arbitrary number field. This is what Schur 
does in §§2-6 of [Sch 05]. Before stating his result, some notation is neces- 
sary: 

- k is a number field, viewed as a subfield of C. 

- For each a > 1, z a denotes a primitive o-th root of unity. 

- (assuming I > 2). We put t = [k(ze) : k] and we denote by m the 
maximal a such that k(zi) contains z ta (this notation coincides with Schur's, 
and it will be extended to arbitrary fields in §4 of Lect.II). We put 



M k (n,£) = m 



~n~ 




' n ' 




" n ' 


1. 


+ 


It. 


+ 


J 2 t\ 



+ 



- (assuming £ = 2). We put t = [k(i) : k] and we define m as explained in 
§4.2 (warning: t and m do not always coincide with Schur's ti and 7712). We 
put: 

M k (n,2) =n+(m'-l) [-J + [-J + [-J +..., 

where m! is equal to m + 1 in case (b) of §4.2 and is equal to m in the other 
cases. 

The main result of [Sch 05] is: 

Theorem 2'. Let A be a finite £- subgroup o/GL n (C) such thatTr(g) belongs 
to k for every g € A. Then ve(A) < Mk(n,£). 

Note that, when k = Q, the integer M^(n, £) is equal to Minkowski's M(n, £); 
hence th.2' is a generalization of th.2. 

Proof. I shall not give all the details of Schur's proof, but just explain its 
main steps. For more information, see [Sch 05] (and also [GL 06] for the 

case £ > 2). 

One of the inputs of the proof is the following result, which had just been 
proved by Blichfeldt ([Bl 04] - see also §3 below): 

2.2.1. Every linear representation of A is monomial. 

Hence one can decompose the vector space C™ as a direct sum of n lines 
D n which are permuted by A. This gives a homomorphism A — V S n \ 
its kernel A' is a normal abelian subgroup of A. Hence: 

2.2.2. The group A has a normal abelian subgroup A' such that [A : A') 
divides n\. 

This led Schur to investigate the case where A is abelian. He proved: 
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2.2.3. If A is as in th.2' , and is abelian, then : 

' m- [f] ifi > 2 

ve(A) < < 

(m'-i) • [f] +n if£ = 2. 

Sketch of proof. Since A is abelian, and the traces of its elements belong 
to k, it is conjugate to a subgroup of GL n (fc). Let R be the fc-subalgebra 
of M. n (k) generated by A. We may write R as a product Y[ Ki, where the 
Ki are cyclotomic extensions of k, of the form k(z (a .), with aj > 0. Let 
ni = [Ki : k\\ then ^rij < n. The image of A in if* is a cyclic group of 
order € % . If £ > 2, it is not difficult to see that a% < m • [y-] for every i. 

Adding up, we find Yl a i — m ' [f]> an d since ve(A) < J2 a i, we § e ^ t ne 
inequality (2.2.3). The case t = 2 is similar. □ 

Once this is done, the case I = 2 follows. Indeed (2.2.2) and (2.2.3) give 
^2(^4) < ^2(^4') + ^2(^0 <n + {m' — t) • [j] + ^2(^0' an d this is equivalent 
to V2{A) < Mk(n, 2). The case £ > 2 requires more work, cf. [Sch 05], §5.D 

Remarks 

1) The bound vi(A) < M^{n,tj is optimal; this is proved by the same 
explicit constructions as in §1.4, cf. [Sch 05], §6. 

2) As we already pointed out in §2.1, the hypothesis Ti{A) C k implies, 
when I > 2, that A is conjugate to a subgroup of GL n (/c). One may then 
use Minkowski's method, as will be explained in §6 for semisimple algebraic 
groups (of course GL n is not semisimple, but the method applies with almost 
no change - the invariant degrees di of §6 have to be replaced by 1, 2, . . . , n). 
The bound found in that way coincides with Schur's. 

For i = 2, if one does not assume that A can be embedded in GL n (fc), 
I do not see how to apply either Minkowski's method or the cohomological 
method of §6.8. This raises interesting questions. For instance, consider a 
finite subgroup A of E$(C), and suppose that the conjugacy classes of the 
elements of A are Q-rational. Is it true that V2(A) < 30, ^(^4) < 13, . . . , as 
would be the case if A were contained in the rational points of a Q-form of 
Ek, cf. §6.3.2 ? 

§3. Blichfeldt and others 

Blichfeldt's theorem (§3.1 below) has already been used in §2.2. The 
results of §3.3 will be applied in §5.4, in order to prove what I call the 
"S-bound". 

3.1. Blichfeldt's theorem. Recall that a finite group A is called super- 
solvable if it has a composition series 

1 = A) C Ai C • • • C A m = A 

where the Ai are normal in A (and not merely in Ai + ±) and the quotients 
Ai/Ai-i are cyclic. One has 

nilpotent =3- supersolvable =3- solvable. 

In particular, an £-group is supersolvable. 
One proves easily: 
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(*) If A is supersolvable and non abelian, there exists an abelian normal 
subgroup A' of A which is not contained in the center of A. 

Recall also that a linear representation V of a group A is called monomial 
if one can split V as a direct sum of lines which are permuted by A. When V 
is irreducible, this amounts to saying that V is induced by a 1-dimensional 
representation of a subgroup of A. 

We may now state Blichfeldt's theorem ([Bl 04], see also [Bu 11], §258): 

Theorem 3. Every complex linear representation of a supersolvable finite 
group is monomial. 

(As a matter of fact, Blichfeldt was only interested in the case where A is 
nilpotent.) 

Proof. The argument is now standard. We may assume that the given rep- 
resentation V is irrreducible and faithful. If A is abelian, we have dim V = 1 
and there is nothing to prove. If not, we choose A' as in (*) above, and we 
split V as V = (BV X , where x runs through the 1-dimensional characters of 
A', and V x is the corresponding eigenspace; let be a non-zero V x ; it is 
distinct from V (otherwise, A' would be central), and every non-zero V x is 
an A-transform of (because V is irreducible). Call B the subgroup of A 
stabilizing V^. We have A 1 C B C A, and \B\ < \A\. Using induction on 
\A\, we may assume that th.3 is true for B; this gives a splitting of as a 
direct sum of lines which are stable under B. By transforming them by A, 
we get the desired splitting of V. □ 

3.2. Borel-Serre. In [BS 53], Borel and I proved: 

Theorem 3'. Let G be a compact real Lie group, and let A be a finite 
supersolvable subgroup of G. There exists a maximal torus T of G which is 
normalized by A. 

Remark. When one applies th.3' to G = U n (C), one recovers th.3. Hence 
th.3' may be viewed as a generalization of Blichfeldt's theorem. 

Proof of theorem 3' (sketch). 

Lemma 3. Let q be a finite dimensional Lie algebra over a field of charac- 
teristic 0, and let s be an automorphism of prime order of g . If s has no 
fixed point ^ 0, then g is nilpotent. 

(Note the analogy with a - much deeper - theorem of Thompson 
[Th 60-64] : if a finite group G has an automorphism of prime order with no 
non-trivial fixed point, then G is nilpotent.) 

Proof of lemma 3. By extending scalars, we may assume that the ground 
field is algebraically closed. Let p be the order of s, and let z be a primitive 
p-th root of unity. Let Qi be the kernel of s — z % in g. We have 

= 0o ©0i © • • • ©0p-l , 

and the hypothesis made on s means that go = 0. One then shows that 
ad(x) p ~ 1 = for every x belonging to one of the g^'s. This implies that the 
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Killing form of g is 0, hence that g is solvable (Cartan's criterion). The fact 
that g is nilpotent follows easily. (For more details, see §4 of [BS 54].) □ 

Once this is done, th.3' is proved by an induction argument similar to the 
one used in the proof of Blichfeldt's theorem, cf. [BS 53], §3. □ 

3.3. Steinberg and Springer-Steinberg. We now come to the setting of 
linear algebraic groups. Let A; be a field, and let G be an algebraic group 
over k. We shall assume in what follows that G is linear and smooth over 
k; the connected component of the identity of G is denoted by G°. Recall 
that G is said to be reductive if it is connected and if its unipotent radical 
(over an algebraic closure of k) is trivial, cf. [Bo 91], §11.21. If k = C, 
such groups correspond (by a standard dictionary, cf. [Se 93], §5) to the 
connected compact Lie groups. [In the literature, a group G such that G° is 
reductive is sometimes called "reductive" ; this is reasonable in characteristic 
0, but not otherwise. Here we prefer that "reductive" implies "connected" .] 
Theorem 3' has the following analogue: 

Theorem 3". Let A be a finite supersolvable group of order prime to char(/c) 
and let G be a reductive group over k on which A acts by k- automorphisms. 
Then there exists a maximal torus T of G, defined over k, which is stable 
under the action of A. 

(When k = C, this is equivalent to th.3', thanks to the dictionary mentioned 
above.) 

Corollary. If A is a finite supersolvable subgroup of G(k), of order prime 
to char (A;), there is a maximal k-torus T of G whose normalizer N is such 
that A is contained in N(k). 

(Recall that, if X is a fc-variety, X(k) is the set of fc-points of X.) 

Proof of theorem 3". When k is algebraically closed, this is proved in [SS 
68], 1.5.16, with the help of several results from [St 68]. For an arbitrary 
field k, the same proof works with very little change. One starts with the 
following basic result of Steinberg ([St 68], th.7.2): 

Proposition 2. Assume k is algebraically closed. Let s : G — >■ G be a 
surjective homomorphism. Then there exists a Borel subgroup B of G such 
that s(B) = B. 

When s has finite order prime to char (A;), one can say much more: 

Proposition 3. Let s be an automorphism of G of finite order prime to 
char(fc), and let G s be the subgroup of G fixed by s. Then: 

a) The connected component of G s is reductive. 

b) One has dim G s > if G is not a torus. 

c) // k is algebraically closed, there exists a Borel subgroup B of G and a 
maximal torus T of B such that s(B) = B and s(T) = T. 

Proof (sketch). We may assume k is algebraically closed, since assertions a) 
and b) are "geometric". A proof of a) is given in [St 68], cor. 9. 4. A proof of 
c) is given in [SS 68], 1.2.9, as an application of prop. 2. Assertion b) follows 
from c) by the following method of Steinberg: one observes that a pair (B, T) 
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with B D T, determines canonically a homomorphism h : G m —> T (indeed 
B gives a basis of the root system of (G,T), and one takes for h twice the 
sum of the corresponding coroots). Moreover, h is non-trivial if G is not a 
torus. The canonicity of h implies that it is fixed by s. Hence G s contains 
lm(h). □ 

End of the proof of th.3" . By induction on \A\ + dimC When A = 1, one 
takes for T any maximal fe-torus of G; by a theorem of Grothendieck, there 
is such a torus (cf. [Bo 91], th.18.2). We may thus assume A ^ 1. In that 
case A contains a cyclic subgroup < s >, non-trivial, which is normal. We 
may also assume that G is semisimple and that A acts faithfully. Let G\ be 
the connected component of G s ; we have dimCi > 0, cf. prop. 3 b). The 
group A/ A' acts on G±. By the induction assumption, there is a maximal 
torus T\ of Gi, defined over k, which is stable under the action of A/A' , 
hence of A. Let G2 be the centralizer of T\ in G. It is a reductive group of 
the same rank as G. We have dim G2 < dim G, since T\ is not contained in 
the center of G. Moreover, G2 is stable under the action by A. By applying 
the induction assumption to the pair (G2 , A) we get a maximal A;-torus T of 
G2 which is ^4-stable. Since G2 and G have the same rank, T is a maximal 
torus of G. □ 



II. Upper bounds 

Let G be a reductive group over a field k, and let i be a prime number, 
different from char (A;). Let A be a finite subgroup of G(k). We want to give 
an upper bound for vg(A), in terms of invariants of G, k and I. We give 
two such bounds. The first one (§5) is less precise, but very easy to apply; 
we call it the S-bound (S for Schur). The other bound (§6) is the M-bound 
(M for Minkowski). Both bounds involve some cyclotomic invariants of k, 
which are defined in §4 below. 

§4. The invariants t and m 

4.0. Cyclotomic characters. Let k be an algebraic closure of k, and let 
k s be the separable closure of k in k. For each n > 1 prime to char(/c), let 
[i n C k* be the group of n-th roots of unity and let z n be a generator of \i n . 

The Galois group T/u — Gal(/c s //c) acts on (z n ) = fi n . This action defines 
a continuous homomorphism 

Xn :r fe ^Aut(/v) = (Z/nZ)*, 

which is called the n-th cyclotomic character of k. 

This applies in particular to n = i d (d = 0, 1, . . . ); by taking inverse limits 
we get the £°°-cyclotomic character 

Xtoo : T k -)■ Z| = Um(Z/^Z)*, 

where is the ring of ^-adic integers. What matters for us is the image 
Im x e oo, which is a closed subgroup of Z|. To discuss its structure, it is 
convenient to separate the cases £ 7^ 2 and 1 = 2. 
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4.1. The case £ ^ 2. We have 

Z £ * = Q_! x {l+£-Z,} 

where is cyclic of order £ — 1 (i.e. Q-i is the group of the 

^-adic field Q^; it is canonically isomorphic to F|). As for 1 + £-Z#, it is 
procyclic, generated by 1 + £, and isomorphic to the additive group Zf, its 
closed subgroups are the groups 1 + l d -Zi = (1 + £ d ), d = 1, 2, . . . , oo, with 
the convention l°° = 0. 

Since £ — 1 and £ are relatively prime, the subgroup Im x loc of Z\ decom- 
poses as a direct product: 

where t is a divisor of I — 1, Cj is cyclic of order £ and m = 1, 2, . . . or oo. 

Remark. An alternative definition of the invariants t and m is: 

t = [k(zi) : k] = fc-degree of zg 

m = upper bound of the d > 1 such that z d is contained in k(z£). 

Examples. If fc = Q or Q^, \ too is surjective and we have t = £—1, m = 1. If 
k = k s , then x £00 is trivial and i = 1, to = oo. If A; is finite with g elements, 
Im x e oo is the closed subgroup of Z\ generated by q and we have: 

t = order of q in F| 
m = - 1) = v e {q^ 1 - 1) . 

4.2. The case £ = 2. Here Z?j = C 2 x {1 + 4-Z 2 }, where C 2 = {1, -1} and 
the multiplicative group 1 + 4-Z 2 is isomorphic to the additive group Z 2 . 
There are three possibilities for Im x 200 : 

(a) Imx 2 oc = 1 + 2 m -Z 2 = (1 + 2 m ), with m = 2, . . . ,oo. We put t = 1. 

(b) Imx 2 oc = (-1 + 2 m ), with m = 2, . . . , oo. We put t = 2. 

(c) Imx 2 oc =C 2 x {l + 2 m -Z 2 } = (-l,l + 2 m ), m = 2,...,oo. We put 
t = 2. 

If m < oo, these types are distinct. If m = oo, types (b) and (c) coincide; 
in that case Im x 200 is equal to C 2 . 

Remark. We have t = [k(i) : k] with the usual notation i = Z4. Hence case 
(a) means that —1 is a square in k, and in that case m is the largest d > 2 
such that z , G k. 

2 d 

If t = 2, case (c) is characterized by the fact that —1 belongs to Imx 200 - 
As for m, it is given by: 

m = — 1 + upper bound of the d > 2 such that z^ d G in case (b) 

m = upper bound of the d > 2 such that z 2d G in case (c). 

Examples. If fc = Q or Q 2 , we have type (c) with t = 2,m = 2. If k = R, 
we have types (b) and (c) with m = 00. If /c is separably closed, we have 
type (a) with t = 1 and m = 00. 

When char(/c) 7^ 0, type (c) is impossible unless m = 00. If fc is finite 
with q elements, we have type (a) with m = v 2 (q — 1) if q = 1 (mod 4) and 
type (b) with m = v 2 (q + 1) if q = — 1 (mod 4). 
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4.3. The case of finitely generated fields. Let ko be the prime subfield 
of k, i.e. Q if char(/c) = or F p if char(fc) = p > 0. Suppose that k is finitely 
generated over ko. Then the invariant m is finite, i.e. Imx £OC , is infinite. 

Indeed, if not, there would be a finite extension k' of k containing the 
group jj, of all the £ d -th roots of unity (d = 1,2,...). Let K = fco(/i) be the 
extension of fco generated by [i. Then: 

(a) K is algebraic over ko 

(b) K is finitely generated over ko (because it is contained in k', cf. 
[A V], §14, cor. 3 to prop. 17). 

Hence K is either a finite field or a number field, which is absurd since 
such a field only contains finitely many roots of unity. 



§5. The S-bound 

We start with the case of tori: 

5.1. The S-bound for a torus: statements. 

Theorem 4. Let T be a torus over k, and let A be a finite subgroup ofT(k). 
Then 

TdimT 
vnA) < m — — — 

where m and t are defined as in §4 above and ip is Euler's totient function. 
The bound given by th.4 is optimal. More precisely: 

Theorem 4'. Assume m < oo. For every n > 1 there exist a k-torus T of 
dimension n and a finite subgroup A ofT(k) such that v^(A) = m - [n/<p(t)]. 

Example. Take k = Q and £ = 2, so that t = m = 2. Then th.4 says that 
any finite 2-subgroup of T(Q) has order < 4 dim T , and th.4' says that this 
bound can be attained. 

5.2. Proof of theorem 4. 

Lemma 4. Let u £ M n (Z£) be an n x n matrix with coefficients in 
which we view as an endomorphism of (Q^/Z^)". Then 

^(ker(w)) = det(u)) . 

Proof. This is clear if u is a diagonal matrix, and one reduces the general 
case to the diagonal one by multiplying u on the right and on the left by 
invertible matrices. □ 

Now let n be the dimension of the torus T. Let Y{T) = Hom^ (G m , T) be 
the group of cocharacters of T. The action of on Y(T) gives a homomor- 
phism p : T k -)• Aut (Y(T)) = GL n (Z). If we identify T with G m x • • • x G m 
(over k s ) by choosing a basis of Y(T), the ^°°-division points of T(k s ) form 
a group isomorphic to (Q^/Z^) n and the action of g G on that group is 
by p(g)x(g), where x = X^- 
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Lemma 5. Let A be a Unite subgroup of T(k). For every g E r& we have 
ve(A) < v e (det(p(g)x(g) -1)) =v £ {det(p(g- 1 )-x(.g))- 

Proof. By replacing A by its £-Sylow subgroup, we may assume that A is 
an ^-group, hence is contained in the ^-division points of T(k s ). Since the 
points of A are rational over k, they are fixed by g, i.e. they belong to 
the kernel of g — 1. The inequality then follows from lemma 4, applied to 

« = p(g) x(g) - !■ D 

We now choose g £ such that the inequality of lemma 5 gives that of 
th.4. Here is the choice: 

x(g) = z tu, where z t E has order t, and V((l— u) = m. 

(This works for £ = 2 as well as for £ ^ 2, thanks to the definition of t in 
§4.1 and §4.2. Note that in all cases but £ = 2, type (c), \{g) ls a topological 
generator of Imx-) 

We have p(g) £ GL n (Z), and p(g) is of finite order (because the image 
of p : Tfc — > GL„(Z) is finite). Hence the characteristic polynomial F of 
p{g~ l ) is a product of cyclotomic polynomials: 

(5.2.1) F = JJ with Vidj) = n. 

The inequality of lemma [5] gives 

Vi(A) <J2vi(<f> dj (z t u)). 
We thus need to compute Vi(<&d(ztu)) for every d > 1. The result is: 
Lemma 6. We have 



if d = t 

v e ($d(ztu)) = { 1 ifd = t-£ a ,a>l or a = -1 (if t = 2 = £) 
otherwise. 

Proof. (We restrict ourselves to the case £ ^ 2. The case £ = 2 is analogous 
but slightly different.) 

We have <&d( z t u ) = Y\( z t u ~ z ) w here z runs through the primitive d-th 
roots of unity in Q^. Write d as d = 5-£ a with (5, £) = 1 and a > 0. The 
images of the z's in the residue field of are primitive 5-th roots of 
unity. If 8 ^ t, none of them is equal to the image of ztu, which has order t. 
In that case, all the ZfU — z are units in hence have valuation and we 
have v^[^d{ z tu)) = 0. If 6 = t, i.e. d = t-£ a with a > 0, there are two cases: 

(a) a = 0, i.e. d = t. In that case, one of the z's is equal to zt and we 
have vi(ztu — z) = v^(u — 1) = m; the other z 7 s contribute 0. 

(b) a > 1. Here z can be written as z' ■ z" where z' runs through the 
i-th primitive roots of 1, and z" through the £ a -th primitive roots of 1. The 
valuation of z — ztu is unless z 1 = Zt, in which case vg{z — ztu) = v^(z" — u). 
It is well-known that Vf(z" — 1) = . Since vi(u — 1) = m, which is 
strictly larger, we have 

v e (z" -u) = v e ((z» - 1) - (u - 1)) = {£ _ = 
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Since the number of the z" is <p{( a ), we thus get ^(^(zju)) = 1, as 
claimed. □ 

We can now prove theorem 4: With the notation of (|5.2.ip . denote by r\ 
the number of j's with dj = t, and by r 2 the number of j's with dj = t-l a i , 
a.j > 1, or a.j = —1 in case £ = 2,t = 2. Using lemmas 5 and 6 we get 

ve(A) < rim + r 2 

and of course 

ncp(t) +^2tp(t-£ a i) <n = dimT. 

Since (p(t-£ a i) > tp(t)(£ - 1) this shows that nip(t) + r 2 ip(t)(£ - 1) < n. 
Hence r\ + r 2 (£ — 1) < [n/tp(t)], and we have: 

vi(A) < rim + r 2 < rim + r 2 {£ — l)m < m[n/cp(t)] , 

which concludes the proof. □ 

Remark. Since (£ — l)m > in all cases (even if £ = 2), the above proof 
shows that vi(A) can be equal to m[n/ip(t)] only when r 2 = 0. In other 
words: 

Complement to theorem 4. Assume Vi(A) = m[n/{p(k)], where 
n = dimT. If g £ T/% is such that x{d) = z tu, with Vi(u — 1) = m 
as above, the characteristic polynomial of p(g) is divisible by (&t) N > with 
N=[n/ip(k)]. 

(In other words, the primitive t-th. roots of unity are eigenvalues of p{g) with 
multiplicity N .) 

When t = 1 or 2 (i.e. when ip(t) = 1), this can be used to determine the 
structure of an "optimal" T: 

Corollary. Assume t = 1 or 2, and vi(A) = mn. Then : 

(i) If t = 1, the torus T is split (i.e. isomorphic to the product of n 
copies of G m ). 

(ii) Ift = 2, T is isomorphic to the product ofn non-split tori of dimension 
1 which are split by the quadratic extension k{z()/k if £ ^ 2 and by k(i)/k 
if £ = 2. 

Proof. We give the proof for t = 2 and £ > 2: the case t = 1 is easier and 
the case t = 2 = £ requires similar, but more detailed, arguments. 

Let 7 6 IV We may write x{l) as e 7 -n 7 , with e 7 G {1,-1} and 
ii 7 £ {1 + £ m 7j(}. There are three cases: 

(a) e 7 = —1 and ^(u 7 — 1) = m 

(b) e 7 = —1 and ^(u 7 — 1) > m 

(c) e 7 = 1. 

In case (a), the "complement" above shows that ^(7) has —1 for eigenvalue 
with multiplicity n, hence p{^f) = —1 in Aut(T) ~ GL n (Z). 

In case (b), choose g £ of type (a); this is possible by the very definition 
of t and m. The element g 2/ y is of type (a) (this uses the fact that £ is odd); 
hence we have p{g 2 'y) = — 1 and since p(g) = — 1 this shows that p{^) = — 1. 

If 7 is of type (c), then 57 is of type (a) or (b) and we have p(g~f) = — 1 
hence p{^) = 1. 
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In all cases, we have ,0(7) € {1,-1}, and more precisely ^(7) = e 7 . The 
corollary follows. □ 

It would be interesting to have a similar classification for t > 2. 

5.3. Proof of theorem 4': construction of tori with large A's. To 

prove th.4' it is enough to construct a fe-torus T, of dimension n = <p(t), such 
that T(k) contains a cyclic subgroup of order £ m . Here is the construction: 
Let K be the field k(z t ) if I ^ 2 and the field k(i) if t = 2. It is a 
cyclic extension of k of degree t with Galois group Ct- Let T\ = R K / k G rn 
be the torus: "multiplicative group of K"; we have T\{k) = K*, and T\{k) 
contains the group {z em }, cf. §4. If a is a generator of Ct, a acts on T\, 
and we have a 1 — 1 = in the ring End(Ti). Let us write the polynomial 
X 1 — 1 as ^t(X)-^(X), where <3?j is the t-th cyclotomic polynomial. We have 
$ t (cr)*(cr) = in End(Ti). Let T = Im*(cr) be the image of 

9(a) : Ti -> Ti. 

One checks that 

(a) dimT = 

(b) T(/c) contains . 

(For £ 7^ 2, (b) follows from the fact that the restriction of *(cj) to (z tm ) is 
an automorphism. For I = 2, use the fact that T is the kernel of $ t (cr).) 
Hence T has the required properties. □ 

Alternate description of T. It is enough to describe its character group 
T* = Hom fes (T, G m ), together with the action of T k on T*\ 

- T* = Z[X]/® t (X) = algebraic integers of the cyclotomic field Q(/t t ) 

- T k acts on T* by T k ->• Imx <00 C t ^ Aut (Q(/x t )). 

(It does not matter which isomorphism of Ct onto Aut (Q one chooses; 
they all give isomorphic tori.) 

5.4. The S-bound for reductive groups. Recall, cf. §3.3, that "reduc- 
tive" => "connected". 

Theorem 5. Let G be a reductive group over k, of rank r, with Weyl group 
W. If A is a finite subgroup of G(k), one has 



vt{A) < m 



+ v t (W). 



Proof. As usual, we may assume that A is an ^-group. In that case it is 
nilpotent, and by the corollary to th.3" of §3.3 there exists a maximal k- 
torus T of G whose normalizer ./V = Nq(T) contains A. Put Wt = N/T; 
this is a finite A:-group such that Wt(^s) — W. If At denotes the intersection 
of A with T(k), we have an exact sequence 

1 — > Ax — > A — > W T {k) . 

Hence ve(A) < V£(At) + By th.4, we have V(,(At) < m - [r/ip(t)]; 

on the other hand Wx(k) is isomorphic to a subgroup of W, hence t^(Wr(A;)) 
< ve(W). The theorem follows. □ 
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Corollary. If r < (p(t), then G(k) is l-torsion free (i.e. does not contain 
any elements of order €) . 



Proof. We have 



0. Hence by th.5 it is enough to show that 



vi(W) = 0, but this follows from th.l of §1.1 since W is isomorphic to 
a subgroup of GL r (Z) and r < tp(t) < t < £ — 1. □ 

Remark. The "S-bound" given by th.5 looks a priori rather coarse: 

(a) The torus T is not an arbitrary torus of dimension r; the fact that it 
is a subtorus of G puts non-trivial conditions on it; for instance the action 
of Tfc on T* = Honi£. s (T, G m ) stabilizes the set of roots. 

(b) The group Wx(k) is in general smaller than W itself, and the image 
of N(k) — >■ Wx(k) may be even smaller. 

It is therefore surprising how often the S-bound is close to being optimal. 
As an example, take k = Q and G of type E%. We have m = 1 and t = t — 1 
(except when t = 2 in which case m = t = 2), r = 8, \W\ = 2 14 3 5 5 2 7. The 
S-bound tells us that, if A is a finite subgroup of G(Q), its order divides the 
number 

M s = 2 30 -3 13 -5 6 -7 5 -13 2 -17-19 - 31. 

We shall see later (cf. §6.3.2 and §7) that the best bound is M = Ms/5-7-17 . 
In particular, the ^-factors of Ms are optimal for all €s except t = 5,7 and 
17. 



§6. The M-bound 

6.1. Notation. From now on, G is a semisimple Q group over k. We denote 
by R its root system (over k s ), by W its Weyl group, and by r its rank. 
The group W has a natural linear representation of degree r. The invari- 
ants of W acting on Q[xi, . . . , x r ] make up a graded polynomial algebra of 
the form Q[Pi, . . . ,P r ], where the Pi are homogeneous of degrees di, with 
d\ < c?2 < • • • < d r , (Shephard-Todd theorem, cf. e.g. [LIE V], §5, th.4 or 
[Se 00], p. 95). The d^s are called the invariant degrees of W (or of G). One 
has 

[Jdi = |W| and J2{2di - 1) = dimG. 

When G is quasi-simple (i.e. when R is irreducible) d r is equal to the Coxeter 
number h = (dimG)/r — 1, and one has the symmetry formula 

di + d r+ i-i = h + 2 . 

Moreover, if j < h is prime to h, then j+1 is one of the d^s. These properties 
make di, . . . , d r very easy to compute (see e.g. the tables of [LIE VI]). 

For instance, for G of type E 8 , the dj's are: 2, 8, 12, 14, 18, 20, 24, 30. 

Let Dyn(ii) be the Dynkin diagram of R. There is a natural action of the 
Galois group on Dyn(i?): this follows from the fact that Dyn(R) can be 
defined intrinsically from G/k s (cf- [LIE VIII], §4, no 4, Scholie, or [SGA 3], 

^We could also accept inner forms of reductive groups, for instance GL n or more 
generally GL_d, where D is a central simple fc-algebra with [D : k] = n 2 . In that case, one 
has r = n, the d;'s are the integers 1,2, ... ,n and th.6 is valid, with the same proof. 
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expose XXIV, §3, p. 344). In what follows (with the only exception of §6.7) 
we make the assumption that the action of on Dyn(i?) is trivial: one 
then says that G is of inner type (it can be obtained from a split group Go 
by a Galois twist coming from the adjoint group of Go). 

Examples of groups of inner type : 

- SL n , or more generally, SL^, where D is a central simple algebra over k. 
-Any group G whose root system has no non-trivial automorphism, e.g. any 
group of type Ai,B r ,C r , G 2 ,F 4 , E 7 , E 8 . 

6.2. Statement of the theorem. We fix £, k, and the root system R of 
G. Recall that Im^oo is a closed subgroup of Z|. Define: 

M(£, k, R) = inf v ^ " X ) = H E v *lX*~ ~ 1) • 

This is either an integer > or oo (it is oo if and only if the invariants m, t 
of k defined in §4 are such that m = oo and t divides one of the d^s, see 
prop. 4 below). 

Theorem 6. Let A be a finite subgroup ofG(k). Then ve(A) < M (£,k, R). 
(Recall that G is semisimple of inner type, cf. §6.1.) 

This is what we call the "M-bound" for vi(A). It will be proved in §6.5 
below by a method similar to Minkowski's. We shall see in Lect. Ill that it 
is "optimal" except possibly in the case £ = 2, type (c) of §4.2. 

For computations, it is useful to write M(£, k, R) explicitly in terms of 
the invariants t and m of §4: 

Proposition 4. (1) If £ / 2 or if £ = 2, t = 1 (case (a)), one has 
M(£,k,R)= (m + v e {di)) 

i 

di= (modi) 

(2) If £ = 2 and t = 2 (cases (b) and (c)) , one has 

M (2, k, R)=n + mr + v 2 {W) , 

where ro (resp. r\) is the number of indices i such that di is even (resp. di 
is odd). 

Proof. Let us begin with the case £ ^ 2. One shows first that, if t\d, one has 
ve(x d — 1) > m + vg(d) for every x £ Imx,™- (This is easy, since x can be 
written as zu with z* = 1 and ve(u — 1) > m, hence x d — 1 = u d — 1.) 

This already shows that M(£,k,R) > J2t\d, ( m + To prove the 

opposite inequality, one chooses x G Im x eoo of the form zu with z of order 
t and V((u— 1) = m. One gets (1). 

The same argument works if £ = 2 and t = 1. If £ = 2 and t = 2, one has 

V2(x d — 1) > m + V2{d) if d is even 
v 2 (x d -l) > 1 if d is odd, 
for every x £ Im x 2DO • This gives: 

M(2,k,R)> ^ 1+ E ( m + v 2(rfi)) =ri + mr + w 2 (W). 

odd even 
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To get the opposite inequality, observe that x = — 1 + 2 m belongs to Imx 2CO 
and check that Yl v^{x 1 — 1) is equal to r\ + mr§ + V2(W) . □ 

Corollary. Let a(t) be the number of indices i such that di = (mod t). If 
a{t) = 0, then G(k) is i-torsion free. 

Indeed, if a(t) = 0, the sum occurring in prop. 4 is an empty sum, hence 
M(£, k, R) = and one applies th.6. □ 

6.3. Two examples: A\ and E%. We take k = Q, so that t = £ — 1 and 
m = 1 if £ > 2 and t = m = 2 if I = 2. 



6.3.1. Type A\. There is only one di, namely d\ = 2, and prop. 4 gives: 

'3 if £ = 2 
M(4Q,Ai) = < 1 if £ = 3 
if £ > 3 . 

V 

In other words, every finite subgroup of G(Q) has an order which divides 
2 3 -3. This bound is optimal in the following sense: 

(a) The split adjoint group PGL2(Q) contains both a subgroup of order 
3 and a dihedral subgroup of order 8 (but no subgroup of order 24) . 

(b) The simply connected group SLh(Q), where H is the standard quater- 
nion division algebra, contains a subgroup of order 24 which is isomorphic to 
SL2(Fa). However the split group SL^Q) does not contain any subgroup 
of order 8 (but it does contain cyclic subgroups of order 3 and 4). 

6.3.2. Type E 8 . If we define M{Q,E S ) as \[ f £ MM ' Eli) , prop.4 gives: 

M(Q,E S ) = 2 30 -3 13 -5 5 -7 4 -ll 2 -13 2 -19-31, see e.g. [Se 79], §3.3. 

By th.6, the order of every finite subgroup of G(Q) divides M(Q,E$). As 
we shall see in the next lecture, this multiplicative bound is optimal. 



6.4. A Chebot are v- style result. We need such a result in order to gen- 
eralize Minkowski's method of §1. 

Let L be a normal domain which is finitely generated over Z as a ring, 
and let k be its field of fractions. If d = dim(L) denotes the Krull dimension 
of L ([AC VIII], §1), one has (loc.cit., §2): 

d = 1 + tr.deg(fc/Q) if char(fc) = 

d = tr.deg(/c/F p ) if char(fe) = p>0. 

Let Specmax(L) be the set of the maximal ideals of L (= set of closed 
points of Spec(L)). If x G Specmax(L), the residue field k(x) = L/x is finite 
(see e.g. [AC V], p. 68, cor. 1). We put Nx = \k(x)\; it is the norm of x. 

When d = 0, L is a finite field, and Specmax(L) has only one element. If 
d > (e.g. when char (A;) = 0), then Specmax(L) is infinite. More precisely, 
the Dirichlet series z(s) = J2 x l/(Nx) s converges for Re(s) > d, and one 
has 



(6.4.1) z(s) ~ log (l/(s - d)) when s^d (with s > d) . 
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See [Se 65], §2.7, which only contains a sketch of proof; complete details 
(for a slightly weaker statement) can be found in [Pi 97], App. B 0; sec 
also [FW 84], pp.206-207. 

Let now n be an integer > 1 which is invertible in L (and hence in k). 
Let Xn '■ Tfc — > (Z/nZ)* denote the n-th cyclotomic character of k, cf. §4.0. 
As in §4, we shall be interested in Imxn C (Z/raZ)*. 

Theorem 7. Let c be an element of (Z/nZ)* ; and let X c be the set of all 
x £ Specmax(L) such that Nx = c (mod n). Then : 

a) If c ^ Im Xn, then X c = . 

b) If c & Im^n and d > 0, then X c is Zariski-dense in Specmax(L) (or 
in Spec(L), this amounts to the same). In particular, X c is infinite. 

A more concrete formulation of b) is that, for every non-zero / € L, there 
exists an x with / x and Nx = c (mod n). 

Example. Take L = Z[l/n]. Then Specmax(L) is the set of all prime 
numbers which do not divide n, and th.7 translates into Dirichlet's theorem 
on the existence of primes in arithmetic progressions. 

Proof of theorem 7. The group C = Im Xn is the Galois group of the cyclo- 
tomic extension k(z n )/k. Let L n be the integral closure of L in k{z n ). One 
checks by standard arguments that the ring extension L n /L is finite and 
etale. In geometric terms, Spec(L n ) — > Spec(L) is a finite etale covering. 
The group C acts freely on Spec(L„), with quotient Spec(L). For every 
closed point x of Spec(L), the Frobenius element cr x of x is a well-defined 
conjugacy class of C (hence an element of C since C is commutative). More- 
over, if we view C as a subgroup of (Z/nZ)*, cr x is the image of Nx in Z/nZ. 
This proves a). 

Suppose now that d > and that c belongs to C = Imxn- Let z c (s) be 
the Dirichlet series ^ l/(Nx) s , where the sum is over the elements x of X c . 
The general Chebotarev density theorem ([Se 65], [Pi 97]) gives: 

(6.4.2) z c (s) ~ t— 7 log(l/(s — d)) when s — > d with s>d. 

I I 

In particular, we have z c (d) = +oo. If the Zariski closure X c of X c were of 
dimension < d— 1, we would have z c {d) < oo, as one sees by splitting X c 
into irreducible components, and applying (6.4.1). Hence b). □ 

6.5. Proof of theorem 6. Let A C G(k) be as in th.6. We want to prove 
that 

v e (A) < M(£,k,R). 

We do it in three steps: 

2 When char(fc) = one can give a stronger statement, in the spirit of the Prime 
Number Theorem: 

For every X > 2, call ttl(X) the number of a; £ Specmax(L) such that Nx < X. Then: 

ttl(X) = (l/d)X d /logX + 0(X d /log 2 X) when X -> oo . 

The general Chebotarev density theorem can also be stated (and proved) in terms of such 
"natural" density (standard method: use Weil-Deligne estimates to reduce everything to 
the known case d — 1). 
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6.5.1. The case where k is finite. Put q = \k\. It is well-known that 

\G(k)\=q N ]l(q li -l), where N = \R\/2 = Y,(^-l)- 

This shows that v^{A) < ^2vi(q d ' — 1). The integer q, viewed as an element 
of Z| , is a topological generator of Im x eoa ■ Hence every element u of Im x e oo 
is an ^-adic limit of powers of q and this implies that ve(u d — 1) > ve(q d — 1) 
for every d > 1. Hence the lower bound which defines M(£, k, R) is equal to 
^2ve(q dl — 1); this proves th.6 in the case where k is finite. 

6.5.2. The case where k is finitely generated over its prime subfield. 

By 6.5.1, we may assume that k is infinite. We need a subring L of k, with 
field of fractions k, which has the following properties: 

(a) L is normal, finitely generated over Z and contains l/l. 

(b) G comes by base change from a semisimple group scheme G over L, 
in the sense of [SGA 3], XIX. 2.7. 

(c) A is contained in the group G(L) of the L-points of G. 

Lemma 7. There exists such an L. 

This is standard, see e.g. [EGA IV], §8.1 □ 

Let us now choose (L,G) with properties (a), (b) and (c). For every 
x G Specmax(L), the fiber G^. of G at x is a semisimple group over k(x), 
of type R. Moreover, the Dynkin diagram of G is finite etale over Spec(L), 
cf. [SGA 3], XXIV.3.2; since it is "constant" for the generic fiber (i.e. 
over k) it is constant everywhere; this shows that the G^. are of inner 
type. The inclusion map i : A — > G(L) gives for every x a homomorphism 
i x : A — > G(k(x)). Since i is injective, there is an open dense subset Xq of 
Specmax(L) such that i x is injective for all x £ Xq. We thus get: 

v e (A) < v e (G(K(x))) = J2v e ((Nx) dt - l) for all x (£ X , 

cf. 6.5.1. Let u be any element of Im^™ • By applying th.7 to the image of 
u in (Z/FZ)* with j = 1, 2, . . . , we find a sequence of points Xj of Xq such 
that lim Nxj = u in Z^ . We have: 

^'-l)=limV^((iVx/-l), 

J— >oo * — » 

and applying the formula above to each of the x^-'s we obtain 

ve(A) < ^2v e (u dl - 1) for every u 6 Imx <00 . 

This proves th.6 in the case 6.5.2. 

[Variant: One reduces the general case to the case where dim(L) = 1 by using 
Hilbcrt's irreducibility theorem, as explained in [Sc 81], p. 2; in the case dim(L) = 1, 
one can apply the standard Chebotarev theorem instead of the general one.] 

6.5.3. The general case. The same argument as for lemma 7 shows that 
G comes by base change from a semisimple group G' over a subfield k! 
of k which is finitely generated over the prime subfield of k (i.e. F p or 
Q). Moreover, one may assume (after enlarging k! if necessary) that A 
is contained in G'(k'). The Galois group Ty acts on the Dynkin diagram 
Dyn(i?) of G' (which is the same as the one of G). Let k" be the Galois 
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extension of k' corresponding to the kernel of IV — > Aut Dyn(i?). Since G 
is of inner type over k, the field k" is contained in k. By base change to k", 
G' gives a semisimple group G" which is of inner type and we may apply 
6.5.2 to (G",A). We get v e (A) < M(£,k",R). Since k" is contained in k, 
we have M(£,k",R) < M(£,k,R) : the group Imx^oo can only decrease by 
field extensions. Hence vz(A) < M(£,k,R). □ 

6.5.4. Remark. Surprisingly, the proof above does not really use the hy- 
pothesis that A is a subgroup of G(k). It uses only that A acts freely on 
G, viewed merely as a k- variety (and not as a homogeneous space); this is 
indeed enough to ensure that V((A) < v^(G(k)) when k is finite. Here is an 
example: take G = SL2, I = 2, k = Q; the M-bound is 3, which means that 
any finite 2-subgroup of SL2(Q) has order < 8. As was said in §6.3.1, there 
is in fact no subgroup of order 8 in SL2(Q). But one can make a cyclic 
group of order 8 act freely on the variety SL2: take for instance the group 
generated by the automorphism: 

'a b\ fd-c -c-d\ fO -l\ fa b\ f 1 T 

c 



d) ^ li^) (^1 I = U J [c dl[-l 1 



Hence, even in this bad-looking case, the M-bound can claim to be "op- 
timal" . 

6.6. An analogue of Sylow's theorem. 

Theorem 8. Let A and A 1 be two finite £-subgroups of G(k). Assume that 
vi(A) is equal to the M-bound M(£,k,R). Then there exists y € G(k) such 
that yA'y' 1 C A. 

Corollary. If both A and A' attain the M-bound, then they are geometrically 
conjugate (i.e. conjugate in G(k)). In particular, they are isomorphic. 

Proof. We may assume that k is finitely generated over its prime subfield. 
If it is finite, th.8 is just a special case of Sylow's theorem. Let us assume 
that k is infinite, and choose L, G as in §6.5.2 with A, A' C G(L). Let Y be 
the subscheme of G made up of the points y with yA'y^ 1 C A. Let X be 
the set of all x G Specmax(L) such that Nx, viewed as an element of Z», is 
of the form ztu with z t of order t and v^(u— 1) = m (note that m is finite, 
cf. §4.3). It follows from th.7, applied to n = £ m+1 , that X is Zariski-dense 
in Spec(L). If x € Specmax(L), the groups A and A' inject into G(k(x)) 
(this is an easy consequence of the hypothesis that £ is invertible in L). If 
moreover x belongs to X, then the same computation as in §5.2 shows that 
ve(G(n(x)) is equal to the M-bound, hence A is an £-Sylow of G(k(x)). By 
Sylow's theorem, this shows that A' is conjugate in G(k(x)) to a subgroup 
of A. In particular, the fiber at x of Y — > Spec(L) is non-empty. Since X 
is Zariski-dense, this implies that the generic fiber Y/ k of Y — s- Spec(L) is 
non-empty, i.e. that Y(k) is non-empty. □ 

Remark. One can show that Y is smooth over L, and hence that Y(k s ) / 
which is slightly more precise than Y(k) / 0. 



Exercise. Show that a family of polynomial equations with coefficients in Z has a 
solution in C if and only if it has a solution in Z/pZ for infinitely many p's. 
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6.7. Arbitrary semisimple algebraic groups. In the previous sections, 
we have assumed that G is of inner type, i.e. that the natural homomorphism 

e : T k -> Aut Dyn(P) 

is trivial. Let us now look briefly at the general case, where no hypotheses 
on e are made. In order to state the result which replaces th.6 we need to 
introduce the linear representations £d of T k defined as follows: 

Let S = Q[Pi, . . . , P r ] be the Q-algebra of VF-invariant polynomials, cf. 
§6.1. Let I = (Pi, ... ,P r ) be the augmentation ideal of S; put V = I/I 2 . 
The vector space V is of dimension r, and is graded; the dimension of its 
d-th component Vd is equal to the number of indices i with di = d. The 
group Aut Dyn(P) acts on S, V and the V^'s; by composing this action with 
e, we get for each d > a linear representation 

e d : T k -> Aut(Vd) . 

Theorem 6'. Let A be a Unite subgroup of G(k). Them 

ve(A)< inf YsMdetix^igf-Edig))) 
a e i k ~r 

d 

(The determinant is relative to the vector space Vd<8)Qe.) 

Proof (sketch). The method is the same as the one used for th.6. There are 
three steps: 

(1) Reduction to the case where k is finitely generated over its prime 
subfield; this is easy. 

(2) Reduction to the case where k is finite, via the general Chebotarev 
density theorem instead of th.7. 

(3) The case where k is finite. In that case, if q = \k\, and if a is the 
Frobenius generator of T^, one has (cf. e.g. [St 68] th. 11.16) 

vi (G(k)) = X>( det(q d - Ed(cj))) = £>( det(x^ (<j) d - e d (a))) 
d d 

hence the desired formula: 

(*) v e (A) < Mtet(x e ~ (g) d - e d (g))) 

d 

in the special case g = a. By applying this to the finite extensions of k, one 
sees that the inequality (*) is valid for all a™, n = 1, 2, . . . , and hence for all 
g € Tk, since the a n are dense in Tfc. □ 

Remark. One may also prove th.6' using ^-adic cohomology, cf. §6.8. 

Example. Take R of type A%, so that Aut Dyn(P) = {1, —1} and e may 
be viewed as a quadratic character of T^. The V^'s are of dimension 1 for 
d = 2, 3 and are otherwise. The action of Aut Dyn(P) on Vd is trivial for 
all d, except d = 3. Hence £2 = 1, £3 = £, and th.6' can be rewritten as: 

v e (A) < inf {v e ( Xeoc (g) 2 - 1) + v e (x tpo (gf - e(g)) } . 
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A similar result holds for the types A r (r > 2), D r (r odd) and Eq, with 2 
(resp. 3) replaced by the even c/j's (resp. the odd dj's). 

6.8. The cohomological method. Let us consider first the general sit- 
uation suggested in §6.5.4 where a finite group A acts freely on a quasi- 
projective k- variety X. As explained in [II 05], §7, one can then give an 
upper bound for vg(A) in terms of the action of on the etale cohomology 
of X. More precisely, let H l c (X) denote the i-th etale cohomology group 
of Xij ts , with proper support and coefficients Q^; it is a finite dimensional 
Q^-vector space which is for i > 2-dim(A). There is a natural action of 
Tk on Hl(X), and, for each g € Tk, one can define the "Lefschetz number" 
Ax (g) by the usual formula: 

AxG?) = J>iyTr( 5 |i^(A)) . 

i 

One has Ax{g) € Zg. Moreover: 
Theorem 6". v e (A) < mf ge r k v e (A x (g)) . 

Proof. See [II 05], §7, especially cor. 7.5. The proof follows the same pattern 
as the other proofs of the present §: one uses Chebotarev to reduce to the 
case where k is finite, in which case the result follows from the fact, due to 
Grothendieck, that, if a is the (geometric) Frobenius generator of T^, then 
Ax(cr) is equal to |A(fc)|, hence is divisible by \A\ since the action of A is 
free. (As in the proof of th.6', one applies this, not only to a but also to its 
powers a n , n > 0, and one uses the fact that the a n are dense in T/, .) □ 

If one applies th.6" to A C G(k), with A acting by left translations on 
X = G, one recovers th.6 and th.6', thanks to the known structure of the 
cohomology of G, cf. e.g. [SGA 4±], p. 230. 

6.9. The Cremona group: open problems. Recall that the Cremona 
group Cr r (k) is the group of A:- automorphisms of the field k(X\, . . . , X r ), i.e. 
the group of birational automorphisms (or "pseudo-automorphisms" , cf. [De 
70]) of the projective r-space over k. For r = 1, one has Cri(fc) = PGL2(&). 
Let us assume that r > 2. As explained in [De 70], Cr r is not an algebraic 
group, but looks like a kind of very large semisimple group of rank r (very 
large indeed: its "Weyl group" is the infinite group GL r (Z)). Not much is 
known about the finite subgroups of Cr r (k) beyond the classical case r = 2 
and k algebraically closed. Here is a question suggested by §5.1: 

- Is it true that Cr r (fc) has no ^-torsion if ip{t) > r? 
In the special case k = Q, r = 2 or 3, this amounts to: 

- Is it true that the fields Q(Xi,X2) and Q(Xi,X2,Xs) have no auto- 
morphism of prime order > 11? (Automorphisms of order 2, 3, 5 and 7 do 
exist.) 

It would be very interesting to attack these questions using cohomology, 
but I do not see how to do this. It is not even clear how to define coho- 
mological invariants of Cr r (C), since there is no natural topology on that 
group. Still, one would like to give a meaning to a sentence such as 

"Cr r (C) is connected for r > 1 and simply-connected for r > 2." 
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III. Construction of large subgroups 
§7. Statements 

We keep the notation of Lecture II: k, £, x e ao , t, m, .... We consider only 
semisimple groups over k with a root system R which is irreducible. The 
M-bound of §6.2 will be denoted by M(£, k, R); it only depends on the pair 
{£, k) via the invariants t and m, and on R via the degrees di, . . . , d r of W. 
We limit ourselves to the case m < oo; see §14 for the case m = oo. 

A pair (G, A), where G is of inner type with root system R, and A C G(k) 
is a finite group, will be called optimal if Vi(A) is equal to the M-bound 
M(£,k,R). (We could assume that A is an ^-group, but this would not be 
convenient for the constructions which follow.) Our goal is to prove: 

Theorem 9. If £ ^ 2, an optimal pair (G,A) exists (for any k, R). 

Theorem 10. If £ = 2, an optimal pair (G, A) exists i/Imx 2 oo is of type 
(a) or (b) in the sense o/§4.2 (i.e. if lmx 200 can be topologically generated 
by one element). 

Theorem 11. In the case 1 = 2 and type (c), there exists (G,A) with 

v 2 (A) = r m + v 2 (W) 
where ro is the number of indices i such that di is even. 

Note that here the M-bound is M(2, k,R) = r\ + r^m + V2(W) with 
7T = r — ro, cf. §6.2, prop. 4. Hence v 2 (A) differs from M(2,k,R) only by 
r\. In particular, A is optimal if r\ = 0. Hence: 

Corollary. // all the di's are even (i.e. if — 1 G W), then an optimal pair 
(G, A) exists for £ = 2 (and hence for all ^'s, thanks to th.9). 

This applies in particular to the exceptional types G 2 , F4, Ej and E%. 

Remarks. (1) The simplest case where the M-bound is not attained is k = Q, 
£ = 2, R of type A 2 , where m = 2, ro = 1, r = 2, the M-bound is 4, and it 
follows from [Sch 05] that v 2 (A) < 3 for every finite subgroup A of G(Q). 

(2) In Theorems 9, 10 and 11, no claim is made on the structure of G 
except that it is of inner type and that its root system is of type R. However, 
if one looks closely at the proofs given in the next sections, one sees that G 
can be chosen to have the following properties: 

- it is simply connected; 

- it splits over the cyclotomic field k(ze) if £ > 2, and over k(i) if £ = 2. 

Simple examples (such as k = Q, £ = 3, G of type G 2 ) show that it is not 
always possible to have G split over k. 

(3) If G is not chosen carefully, the group G(k) may not contain any large 
^-subgroup, even if k contains all the roots of unity. For instance, when R 
is of type A\ (resp. of type Eg) it is easy (resp. it is possible) to construct 
a pair (G, k) such that the only torsion elements of G(k) have order 1 or 2 
(resp. G(k) is torsion free). 

(4) The three theorems above are almost obvious if the characteristic is 
p / (especially Theorem 11 since type (c) never occurs!): one takes a 
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finite field ko contained in k which has the same invariants t and m (this 
is easily seen to be possible - if k is finitely generated over F p , one chooses 
the maximal finite subfield of k), and one takes for G the group deduced by 
base change from a split group Go over ko with root system R. If we choose 
for A the finite group Go(ko), it is clear from the way we got the M-bound 
that vi(A) = M(£, fc , R) = M(£, k, R), so that (G, A) is optimal. 

In what follows, we shall assume that char(fe) = 0. Note also that we 
could replace k by any subfield having the same invariants t and m, for 
instance the intersection of k with the field of £°°-roots of unity. We could 
thus assume that k is a cyclotomic number field, if needed. 

The proof of Theorem 9 will be given first for classical groups (§9), by 
explicit elementary constructions similar to those of Schur. The more inter- 
esting case of exceptional groups (§12) will use different methods, based on 
Galois twists (§10), Tits groups and braid groups (§11). The case of I = 2 
will be given in §13. The last section (§14) is about m = oo. 

§8. Arithmetic methods (k = Q) 

These methods are not strong enough to prove the statements of §7, but 
they give very interesting special cases. 

8.10. Euler characteristics. Here, the ground field is Q. One starts 
from a split simply connected group scheme G over Z (this makes sense, 
cf. |SGA 3j ). One may thus speak of the group T = G(Z) of the integral 
points of G. It is a discrete subgroup of G(R). Its Euler characteristic xCO 
( "caracteristique d'Euler-Poincare" in French) is well-defined (see [Se 71] 
and [Se 79]); it is a rational number. Moreover it is proved in [Ha 71] that 

(8.10.1) x{ r) = cf[\((l-d l ) = cfl^ 1 

i=l i=l 1 

where is the d-th Bernoulli number, £ is the zeta function and 
c= |W|/|Wk-| where Wk is the Weyl group of a maximal compact sub- 
group K of G(R). Assume that all di's are even (if not, all the terms in 
(|8.10.ip are zero). Using standard properties of Bernoulli numbers, one can 
check that the M-bound relative to i is M = vt( den (|C(1 ~~ <^))) ; where 
"den" means denominator. Hence, if I does not divide c, and does not di- 
vide the numerator of any |C(1 — di) (which is the case if i is a so-called 
regular prime), one sees that the denominator o/EP(r) is divisible by £ M . 
But a theorem of K. Brown [Br 74] shows that this is only possible if T con- 
tains a finite subgroup of order £ M . Hence we get an optimal pair (provided 
(c, I) = 1, and I is regular, say). 

Example. Take G of type Eg; here c = 3 3 -5, and the numerators of the 
|C(1 — di) do not cancel any denominator. Hence one obtains that a split 
Eg, contains an optimal A for all £ ^ 3,5, with the extra information that 
A can be found inside the group T = G(Z) - but no information on what it 
looks like! 
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8.11. Mass formulae. In [Gr 96j . B. Gross considers Q-forms of G such 
that G(R) is compact] he also requires another condition which guarantees 
that G has a smooth model over Z. This condition is fulfilled for types B, 
D, G2, F4 and E%. He then proves a mass formula a la Minkowski ( [Gr 96| . 
prop. 2. 2): 

E ^ = 11^(1-4) 

i=i 

where the A a are the Z-points of the smooth models of G over Z (taken up 
to conjugation). Each A a is finite. It is then clear that, if £ N is the £-th 
part of the denominator of Yii=i ^C(l ~~ (k), the ^-Sylow subgroup of one of 
the A a has order > £ N . If is equal to the Minkowski bound M (which 
happens if £ does not divide the numerator of any of the ^C(l ~~ di)), then 
such a Sylow has order £ M , and we get an optimal pair. Note that there is 
no extra factor "c" as in (j8.10.ip . This works very well for G2, F4, Eg (and 
some classical groups too, cf. |Gr 96 ]): 

G2 - Here the mass is \ ({— 1)C( — 5) = ^6337' ano ~ ^ * s obtained with just 
one A a , which turns out to be isomorphic to G2(F2). 

F4 - There are two A a 's and the mass formula is 

' + oi, J 7-0 = 4C(-i)C(-5)C(-7)C(-H) 



2 15 -3 6 -5 2 -7 2 12 -3 5 -7 2 -13 



16 

691 



2 i5 . 36 . 52 . 72 . 13 ■ 



Eg - Here the numerator is very large, but the denominator is exactly what 
is needed for the M-bound, namely: 

2 30 . 313 . 55 . 7 4 . 11 2 . 13 2 . ig . 31 _ 



§9. Proof of theorem 9 for classical groups 

Here £ ^ 2. Recall that Imx f0 o = C t x {1 + ^ m Z £ }, where m > 1 and t 
divides i — 1. The M-bound is 

M= J2 {m + v £ (di)). 

i 

di=0(mod t) 

We denote by K the field k(z() generated by a root of unity of order I. It 
is a cyclic extension of k, of degree t, with Galois group Ct- It contains z e , n 
but not z , 1 , cf. §4.1. 

£771+1 " O 

9.1. The groups An and A^. If is an integer > 1, we denote by A tv 
the subgroup of GLn(K) (where K = k(zg) as above) generated by the 
symmetric group Sn and the diagonal matrices whose entries are ^ m -th 
roots of unity (wreath product of Sn with a cyclic group of order £ m ). We 
have 



(9.1.1) 



v e (A N ) = mN + v e (N\). 
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The image of detx : An — > K* is {±1} x (z em ). Hence the kernel ^4^- is 
such that 

(9.1.2) v i (Aj f ) = m(N-l)+v i (N\). 

We are going to use An, and sometimes ^4^, in order to construct optimal 
subgroups for the classical groups SL„, SO„ and Sp n ; this is what Schur 
did in |Sch 05| . §6, for the case of GL„. 

9.2. The case of SL„. The d^s are 2, 3,. . ., n. If we put N = we have 

(9.2.1) M = mN + Vi(Nl) if*>2, 

(9.2.2) M = m(N - 1) + v e (N\) if t = 1, in which case N = n. 

In the case t > 2, we take An C G~Ln(K) C G~LNt(k), and observe that 
detfc(^4/v) is equal to ±1 (indeed, if g E An, then det&(<?) = N K / k (detK(9)) 
and one checks that N K / k (z pn ) = 1). This shows that an ^-Sylow of An is 
contained in ShNt(k) and hence in SL„(fc). By (|9.2.ip we get an optimal 
pair. 

In the case t = 1, we use the same construction with ^4^ instead of An- 
The comparison of (|9,1.2p and (|9.2.2p shows that we get an optimal pair. 

9.3. The case of the orthogonal and symplectic groups, t odd. Let 

us consider the case of Sp 2n . The d^s are equal to 2, 4, ... , In. Hence, if we 
put JV = [2] , the M-bound is mN + vg(N\). There is a natural embedding: 

GLat ->■ Sp 2N -)■ Sp 2n 

( x \ 

defined by x i-> I ^ t j J . The image of A^v by that embedding is optimal. 

The same construction works for S02n and SO^n+i- (Note that, in all 
these cases, we get the split forms of the groups of type B n , C n , D n . This 
is no longer true in the case t is even - nor in the cases of §12.) 

9.4. The case of the orthogonal and symplectic groups, t even. 

Since t is even, the group C% = Gal(K/k) contains an element a of order 
2; its image in ZJ is —1. Let Kq be the subfield of K fixed by a; we have 
[K : Kq] = 2, [Kq : k] = to with to = t/2. Moreover a(z em ) is equal to 
(^m) _1 ; i.e. a acts on z im just as complex conjugation does. Let us define 
an hermitian form h on K N (where N is a given integer > 1) by the standard 
formula 

N 

K x ,y) = ^ J x i -a(y i ), if x = (xi, . . .,x N ), V = {yi, ■ ■ -,Vn)- 

i=l 

If Ujv denotes the unitary group associated with h, it is clear that the group 
An defined in §9.1 is contained in XJn{K). [We use here the traditional 
notation XJn{K) for the unitary group; this is a bit misleading, since is 
an algebraic group over Kq, and we are taking its -Ko-points.] 

Let 5 € K* be such that a (6) = —5, e.g. 5 = Z£ — zj l . We have 
K = Kq © S-Kq, and h(x, y) can be decomposed as 

h(x, y) = qo{x, y) +5 ■ b (x, y), with q (x, y) G Kq, b (x, y) E K . 
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Then qo (resp. 60) is a non-degenerate symmetric (resp. alternating) K$- 
bilinear form of rank 2N . 

Its trace q = Tr Ko / k qo(x, y) (resp. b = Tix /k bo(x, y)) is of rank 2Nt$ = Nt 
over k. We thus get embeddings: 

(9.4.1) A N -»• V N (K) -> SO 2N (K ) -> SOrf) 

(9.4.2) Aiv Ujv(if) -> Sp 2JV (^ ) -> S Pjvt (fc). 

Now, for a given n, let us define iV by A?" = [^p] = 
an embedding 

Aat -> Sp^/c) -> Sp 2n (A;) ) 

and one checks that it is optimal. 

The same method gives an embedding of An into SOm(k), hence into 
SC>2n+i(&)) and this embedding is also optimal. As for S02 n (/c), one has to 
be more careful. The method does give an embedding of An into the S02« 
group relative to some quadratic form Q, but we have to ensure that such 
an SC>2n group is of inner type i.e. that disc(Q) = (-l) n in k*/k* 2 . There 
are three cases: 

a) If 2n > Nt (i.e. if t does not divide 2n), we choose Q = q ® qi, where 
qi has rank 2n — Nt, and is such that disc(g) • disc(gi) = (— l) n . We then 
have An C S0 2n) g(fc) and this is optimal. 

b) If 2n = Nt and N is even, we have disc(g) = d , where d = disc(ATo/fc), 
hence disc(q)= 1 in k*/k* 2 , which is the same as (— l) n since n is even. 

c) If 2n = Nt and N is odd, we use an optimal subgroup A of S0 2 „_i(fc) 
relative to a quadratic form qo of rank 2n — 1. By adding to go a suitable qua- 
dratic form of rank 1, we get a quadratic form of rank 2n and discriminant 
(— l) n , as wanted. The corresponding embedding 

A^S0 2 „-i(fc)^SCMA:) 

is optimal. (Note that the d^s for type D n are 2, 4, . . . , 2n — 2, and n. Hence, 
if i /fn, the M-bound for D n is the same as the M-bound for £?„_i.) 

§10. Galois twists 

To handle exceptional groups, we have to use twisted inner forms instead 
of split ones. We shall only need the most elementary case of twisting, 
namely the one coming from a homomorphism ip : — > Aut(G). Let us 
recall what this means (cf. for example [Se 64] . chapter III): 

Let K/k be a finite Galois extension. Let X be an algebraic variety over k, 
assumed to be quasi-projective (the case where X is affine would be enough). 
Choose a homomorphism 

tp : Gal(K/k) -> Aut fe X. 

The twist X v of X by 92 is a variety over k which can be characterized as 
follows: 

There is a A'-isomorphism Q:Xj^ — > X V ^ K such that ~f(0) = o 93(7) for 
every 7 € Gal(K/k). 



to 



By (9.4.2), we get 
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(Here X/ K denotes the K- variety deduced from X by the base change 
k — ?> K, and 99(7) € Aut^ X is viewed as belonging to Autx X/ K .) 

One shows (as a special case of Galois descent) that such a pair (X^,#) 
exists, and is unique, up to isomorphism. 

It is sometimes convenient to identify the iT-points of X and X^ via the 
isomorphism 9. But one should note that this is not compatible with the 
natural action of Gal(K/k) on X(K) and X^K); one has 

7(0(x)) = ¥>(7)(7(a;)) if 7 € Gal{K/k),x € X{K). 

In other words, if we identify X V {K) with X(K), an element 7 of Gal(ivT/A;) 
acts on Xp(K) by the twisted action : 

x ^ ^(7) (7(2)) 

In particular, the k-rational points of X^ correspond (via 6~ l ) to the points 
x € X(K) such that 7(2:) = v?(7 _1 )x for every 7 E Gal(i^//c). 

In what follows we apply the 92-twist to X = split form of G, with ^(7) 
being a k- automorphism of G for every 7 € Gal(i^/fc). In that case, G<p is 
a fc-form of G\ this form is inner if all 93(7) belong to G ad (-KT) where G ad is 
the adjoint group of G. The effect of the twist is to make /c-rational some 
elements of G which were not. In order to define (p, we shall have to use the 
/^-automorphisms of G provided by the Tits group W*, see next section. 

§11. A general construction 

Here, G is a split simply connected group over k, and T is a maximal split 
torus of G. We put N = N G (T) and W = N/T is the Weyl group. 

11.1. The Tits group. The exact sequence 1— > T ^ N — > W — > I does 
not split in general. However Tits ([Ti 66aj . [TT 66b]) has shown how to 
construct a subgroup^ W* of N(k) having the following properties: 

(1) The map W* — > W is surjective. 

(2) The group W* D T is equal to the subgroup T2 of T made up of the 
points x of T with x 2 = 1. 

We thus have a commutative diagram, where the vertical maps are inclu- 
sions: 

1 — > T 2 — » W* — > — > 1 

4- 4" 4" 4" 
l^r^N w 1 

We refer to Tits (loc. cit.) and to Bourbak^ ([LIE X], pp. 115-116, exerc. 
12, 13) for the construction and the properties of W*. For instance: 

If G comes from a split group scheme G over Z, then W* is equal to iV(Z), 
the group of integral points of the group scheme A r . 



The construction of W depends on more than (G,T): one needs a pinning 
("epinglage") of (G, T) in the sense of |SGA 3j . XXIII. 1.1. 

4 Bourbaki works in the context of compact real Lie groups; his results can easily be 
translated to the algebraic setting we use here. 
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In the case of SL n , this means that one can choose for W* the group of 
monomial matrices with non-zero entries ±1 and determinant 1. For n = 2, 

W* is the cyclic group of order 4 generated by 

Note also that W* is a quotient of the braid group associated to W . 
(For the definition of the braid group of a Coxeter group, see e.g. I>.\1 97] .) 

11.2. Special elements of W. We now go back to our general notation 
l,m,t,... of Lecture II. Recall that the M-bound M = M(£, k, R) is given 

by ' 

(11.2.1) M = ^2(m + v £ (di)) , cf. §6.2. 

t\di 

Let a(t) be the number of indices i such that di = (mod t). We may rewrite 
(fTTXTT) as 

(11.2.2) M = ma(t) + ^ v £ (di). 

t\di 

Note that, if no di is divisible by t, we have M = and the trivial group 
A = 1 is optimal. Hence we shall assume in what follows that a{t) > 1. 

Let now w be an element of W. We shall say that w is special (with 
respect to t and €) if it has the following four properties: 

(1) w has order t in W. 

(2) w is the image of an element w* of W* such that (w*)* G T 2 n C(G), 
where C(G) is the center of G. 

(3) The characteristic polynomial of w (in the natural r-dimensional rep- 
resentation of W) is divisible by ($t) a ^, where <]?£ is the i-th cyclotomic 
polynomial. 

(Equivalently: if 2^ denotes a primitive t-th root of unity, then z% is an 
eigenvalue of w of multiplicity at least a(t).) 

(4) Let Cw(w) be the centralizer of w in IF. Then: 

(C w (w)) >J2 v i(di)- 

t\di 

Remark. The reader may wonder whether special elements exist for a given 
pair (t,£) (with a(t) > and 1 = 1 (modi), of course). The answer is 
"no" in general: if R is of type C3 and t = 4, no element of W* has both 
properties (1) and (2). Fortunately, the answer is "yes" for the exceptional 
types G 2 ,...,-E , 8 , cf. §12. 

Example : the regular case. Suppose that w £ W is regular of order t in the 
sense of Springei|j ( |Sp 74| , bottom of p. 170 - see also |BM 97] . §3). This 
means that w has an eigenvector v, with eigenvalue Zt, such that v does not 
belong to any reflecting hyperplane. Then w is special (for any t with 1 = 1 
(mod t)). Indeed: 

^With a slight difference: Springer requires t > 1 and we don't; it is convenient to view 
w = 1 as a regular element of W. 

Note that, if t is given, there is a very simple criterion ensuring the existence of a regular 
element of W of order t: the number of indices i such that di = (mod t) should be equal 
to the number of i's such that di = 2 (mod t), cf. Lehrer-Springer |LS 99] . cor. 5. 5. 
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(1) is obvious. 

(2) follows from the fact, proved in [B M 97] . §3, that w has a lifting w 
in the braid group B^y with w* = 7r, where tt has an image tt in W which 
belongs to T 2 D C(G). In Bourbaki's notation ([LIE X], p. 116) tt is the 
canonical element zq of the center of G. 

(3) is proved in Sp 74 , th. 4.2. 

, th. 4.2, in the stronger form |Cjy(W)| = Y\ t \ di d%. 



(4) is proved in 



Sp 74 



Special cases 

t = 1. Here w = 1 and w* = tt (one could also take w* = 1). 

t =2. Here w = wq = longest element of W . When —1 belongs to W, 
one has u>o = — 1 and Wq is central in W* (because wo is central in By/, cf. 
[BM 97| . 1.2 and 3.4). In that case the inner automorphism of G defined by 
Wq is a "Weyl-Chevalley involution": it acts on T by t H- 1 . 

11.3. An auxiliary result. 

Lemma 7. Suppose w £ W is special of order t. Then it is possible to 
choose a lifting w* of w in W* which satisfies 

(2*) (w*Y G T 2 n C(G) 
and 

(4*)«/(C W r.K))>E t | (i( « / (d i ). 

Proof. Let P be an £-Sylow of CV(w); the groups P and (if) commute, and 
P n (w) = 1 since to has order i and £ is prime to t (since £ = 1 modi). 
Hence the group generated by iu and P is the direct product P x (w). 
Since £ ^ 2, its 2-Sylow subgroup is contained in (w). Put C2 = T 2 H C(G). 
We have an exact sequence: 

1 -> T 2 /C 2 -> W*/C 2 ->■ W ->■ 1. 

By property (2) of w, this exact sequence splits over (w), hence over the 
2-Sylow of P w ; since the order of T 2 /C 2 is a power of 2, this implies that 
it splits over P w . We thus get an element w' of W* /C2, of order t, which 
lifts w, and centralizes a subgroup P' of !v"*/C 2 isomorphic to P. We then 
choose for w* a representative of u>' in W*; it has property (2*), moreover its 
centralizer contains the inverse image of P', which is canonically isomorphic 
to C 2 x P' . By property (4) we have 

v l {P')=v i {P)>Y J Vl{di)- 
t\di 

This shows that w* has property (4*). □ 

Remark. In the case where w is regular, one can do without lemma [71 
Indeed the braid group construction of [B M 97j gives a lifting w* of w having 
property (2*) and such that the map Cw*(w*) — > CV(fw) is surjective. 

11.4. The main result. 

Proposition 5. Suppose W contains an element w which is special with 
respect to t and £. Then there exist an inner twist G v of G (cf. §10) and 
a finite £-subgroup A of G v (k) such that the pair (G V ,A) is optimal in the 
sense of §7. 
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(In particular, th.9 is true for (k,£,R).) 

Proof. As in §9, we put K = fc(^), where is a root of unity of order I. 
Let Ct = Gal(K/k); it is a cyclic group of order t. 

Choose w* £ W* with the properties of lemma [7] and let cr be the inner 
automorphism of G defined by w* . Since a has order t, there exists an 
injective homomorphism: 

if : C t -> G &d (k) C Aut fc (G) 

which maps Ct onto the subgroup (a) of Autfc(C) generated by a. As ex- 
plained in §10, we may then define the ip -twist G v of G, relatively to the 
Galois extension K/k. The group G^ is an inner form of G; it has the same 
root system R. It remains to construct a finite ^-subgroup A of G v {k) such 
that (Gtp,A) is optimal, i.e. vi(A) = ma(t) + Yltidi v t(di)i cf- (11-2.2). 

We take for A the semi-direct product E m ■ P, with E m C T v (k) and 
P C Nu,(k), where E m and P are defined as follows: 

(1) P is an £-Sylow of Cw* (w*). By lemma[7|we have v^{P) > Ylt\di v e{^i)- 
Note that the points of P are fixed by a. Hence these points are rational 
over k not only in the group G but also in the group G v . 

(2) E m is the subgroup of T v {k) made up of the elements x such that 
x = 1. 

It is clear that P normalizes E m , and that P n £/ m = 1. 

Lemma 8. T/ie group E m contains a product of a{t) copies of the group 
Z/£ m Z. 

This implies that vi(E m ) > ma(t) and hence 

v e (A) = v e (E m ) + v e (P) > ma(t) + v e (di). 

t\di 

We thus get vg(A) > M and since M is an upper bound for vi(A) we have 
Vi(A) = M. □ 

Proof of lemma 8. Consider first the subgroup T^m of T(k s ) made up of 
the elements x with x 1 ™ = 1. Since T is /c-split, and K = k(zi) = k{z(m) 
(cf. §4 and §9), the points of T^m are rational over K. If we write T(m{K) 
additively, it becomes a free Z/£ m Z-module of rank r and the action of a 
generator s of Ct is by x i— > sx, where s is identified with an element of order 
t in Z| (i.e. s ="^t" with our usual notation for roots of unity). As for the 
action of w* (i.e. of w) on Tgm(K), it can be put in diagonal form since w is 
of order t and t divides I — 1; its diagonal elements are r elements yi, . . . , y r 
of Z/£ m Z, with y\ = \. Let c be the largest integer such that (&t) c divides 
the characteristic polynomial of w. By property (3) of 11.2, we have c > a(t) 
(in fact, c = a(t), by |Sp 74| , th. 3.4). This implies that the family of the 
UiS contains c times each primitive i-th root of unity (viewed as element of 
(Z/e m Z)*). In particular, there is a Z/^Z-submodule X of T t m{K) which 
is free of rank c and on which w acts by x i— > 2^ 1 a;. If we twist C7, T, T^m by 
the new action of Ct = Gdl(K/k) on X is trivial (cf. end of §10). This 
means that X is contained in T^^k), hence in E m , which proves the lemma. 

□ 
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Note the following consequence of proposition 4: 

Corollary. If W contains a t-regular element in the sense of |Sp 74| ; then 
theorem 9 is true for k, £, R. 

In the case t = 1, no twist is necessary (one takes w = 1, w* = 1, cf. 
§11.2). 



§12. Proof of theorem 9 for exceptional groups 

In each case we will show that the Weyl group contains an element w 
which is special with respect to t and £, so that we may apply prop. 5. 

12.1. The case of G2. The degrees di are d\ = 2,^2 = 6. Since t divides 
one of them, t is a divisor of 6, hence is regular ( |Sp 74| , no. 5.4). We may 
then apply prop. 5. □ 

Explicit description of w, w*: if c is a Coxeter element of W, c is of order 
6, and every lifting c* of c in W* has order 6. Hence, for any divisor t of 6, 
we may take w = c 6//t and w* = (c*) 6// *. 

12.2. The case of F4. The d^s are: 2, 6, 8, 12. All their divisors are regular 
(Springer, loc. cit.). One concludes as for G2. □ 

12.3. The case of E 6 . The dj's are: 2, 5, 6, 8, 9, 12. All their divisors are 
regular, except t = 5. In that case, choose any element w 6 W of order 5. 
Since the kernel of W* — > W is a 2-group, w can be lifted to an element w* 
of W* of order 5. Conditions (1) and (2) of §11.2 are obviously satisfied. 
The same is true for condition (3), since o(5) = 1 (only one of the dj's is 
divisible by 5), and w has at least one eigenvalue of order 5. As for condition 
(4), it is trivial, since £ = 1 (mod 5) implies £ > 11, and £ does not divide 
any of the d^s, so that Ylt\di v i(di) is 0. Hence w is special with respect to 
(5,£). □ 

12.4. The case of E 7 . The dja are: 2, 6, 8, 10, 12, 14, 18. By |Sp 74| , 
loc. cit. all their divisors are regular except 4, 5, 8, 10, 12. If t = 4, 5, 8 or 
12, t already occurs for E$, with the same values of a(t), namely 2, 1, 1 and 
1. Hence, we have ^-special elements ^4,105,^8 and W12 in W(Eq). One 
then takes their images in W(-EV) by the injective map W(Eq) — > W(Ej), 
and one checks that they are ^-special (here again condition (4) is trivial 
since ve(di) = for all the ts with £ = 1 (modi)). 

As for t = 10, one takes w = —W5, which makes sense since — 1 £ W. 
The element —1 (usually denoted by Wq) can be lifted to a central element 
e of W* with e 2 € T2 nC(G); this is a general property of the case —l£W 
(which reflects the fact that —1 is 2-regular, see end of §11.2). Hence, if w§ 
is a lifting of W5 of order 5, £w§ is a lifting of w of order 10, and this shows 
that w is special with respect to 10 and I. □ 

12.5. The case of E 8 . The dj's are: 2, 8, 12, 14, 18, 20, 24, 30. By |Sp 74| , 
loc. cit., all their divisors are regular except 7, 9, 14, 18. 

If t = 7 (resp. 9), one chooses w-j G W of order 7 (resp. tug G of order 
9). Since 7 and 9 are odd, condition (2) of §11.2 is satisfied. The same is true 
for condition (3) because a{t) = 1, and for condition (4) because vi(di) = 
for all i. 
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If t = 14 (resp. 18), one takes w = —wj (resp. w = — wg), as we did for 
E 7 . □ 

§13. Proof of theorems 10 and 11 

Here £ = 2. There are three cases (cf. §4.2): 

(a) Imx 2 oc = 1 + 2 m 7i 2 with m > 2. In that case the M-bound is 
rm + V2(W), and th.10 asserts that an optimal pair (G,A) exists for ev- 
ery type R. 

(b) Imx 2 oc = (-l + 2 m ), with m > 2. The M-bound is r m + n + v 2 (W), 
where ro (resp. r{) is the number of i's such that d, is odd (resp. even). 
Here, too, th.10 asserts that an optimal pair exists. 

(c) Imx 200 = (-1, 1 + 2 m ), with m > 2. 

The M-bound is the same as in case (b), but th.ll does not claim that 
it can be met (i.e. that an optimal pair exists); it merely says that there is 
a pair (G, A) with v 2 (A) = r§m + v 2 (W); such a pair is optimal only when 
r\ = 0, i.e. when —1 belongs to the Weyl group. 

13.1. Proof of theorem 10 in case (a). We take G split and simply 
connected, and we choose a maximal split torus T. We use the notation 
(N,W,W*) of §11. Let E be the 2-torsion subgroup of T(k). Since T is 
isomorphic to the product of r copies of G m , E is isomorphic to a product 
of m copies of Z/2 m Z, cf. §4.2. Hence v 2 (E) = rm. The group E is 
normalized by the Tits group W*; we define A as A = E-W*. The exact 
sequence 

1 ->■ E -> A^W 1 
shows that v 2 {A) = rm + v 2 (W). Hence (G, A) is optimal. 

13.2. Cases (b) and (c). As in §11.4, we start with a split G, with a split 
maximal torus T. We define N, W, W* as usual. After choosing an order on 
the root system R, we may view W as a Coxeter group; let u>q be its longest 
element. It has order 2, and it is regular in the sense of Springer |Sp 74| . As 
explained in §11.2, this implies that there is a lifting Wq of wq in W* which 
has the following two properties: 

(i) its square belongs to the center of G; 

(ii) the natural map Cw*(w;q) — > C\y(wo) is surjective. 

Let a be the inner automorphism of G defined by Wq. By (i), we have a 2 = 1. 
Let K = k(i) and let <p be the homomorphism of G&l(K/k) into Aut^(G) 
whose image is {l,cr}. Let us define G^ as the ip-twist of G, in the sense 
defined in §10. Denote by T v , and W* the (^-twists of T, N and W*. We 
have an exact sequence 

1 -> T v -> N v -> W v -»• 1, 

where Wm is the p-twist of W. Note that W ip (k) is equal to the centralizer 
C\y(wo) of wq in W, and similarly W*(k) is equal to C w , (wq). 

As in §13.1, let E be the 2-torsion subgroup of T^k). It is normalized by 
C w , (wq). Define A C G v (k) to be the group A = E ■ C w , (wg). By (ii), we 
have an exact sequence: 

1 -> S ->■ A -> C^(^o) -> 1 , 
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which shows that v 2 (A) = V2(E) + ^(^(wo))- The fact that wq is regular 
of order 2 implies that 

\Cw(w )\ = JJ<ij, 

2\di 

hence v 2 (Cw{wq)) = ^ v 2 (di) = v 2 (W) . This gives: 

(13.2.1) v 2 (A) = v 2 (E) + v 2 (W). 

Proposition 6. We have : 

v 2 (E) = n + rom in case (b) 
v 2 (E) = rom in case (c). 

In case (b), this shows that (G V ,A) is optimal, which proves th.10. Sim- 
ilarly, the fact that v 2 (A) = rom + v 2 (W) proves th.ll in case (c). 

13.3. Proof of proposition 6. We need to describe explicitly the torus 
T<p. To do so, let us first define the following two tori: 

G G m = 1-dimensional torus deduced from G m by Galois twist relatively 
to K/k. Its group of fc-points is K\ = KerA^^ : K* — >■ k*. 

R K i k G m = 2-dimensional torus deduced from G m by Weil's restriction 
of scalars relatively to K/k. Its group of fc-points is K*. 

Lemma 9. The torus T v is isomorphic to the product of r\ copies of 
R K i k G m and ro — r\ copies of G^. 

Proof. The character group X = Hom(T, G rn ) is free of rank r, with basis 
the fundamental weights u)i,...,u) r . This gives a decomposition of T as 

T = Ti x T 2 x • • • x T r , 

where each T« is canonically isomorphic to G m . Let r = — wq be the oppo- 
sition involution of the root system R; it permutes u±, . . . ,co r with r\ orbits 
of order 2, and ro — r\ orbits of order 1. (This follows from the fact that 
— 1 is an eigenvalue of wq of multiplicity ro.) The involution r permutes the 
tori Tj. If an index j is fixed by r, then ^o acts on Tj by t t^ 1 and the 
twisted torus (Tj)^ is isomorphic to G^; similarly, if r permutes j and f, 
the torus (Tj x Tj/)^ is isomorphic to R K / k G m . This proves lemma 9. □ 

End of the proof of prop. 6. The 2-torsion subgroup of G^(fc) = K\ is cyclic 
of order 2 m ; the 2-torsion subgroup of Rx/kG m (k) = K* is cyclic of order 
2 m+1 in case (b) and of order 2 m in case (c). We get what we wanted, 
namely: 

case (b): v 2 (E) = r\(m + 1) + (ro — r\)m = r$m + r\ 
case (c): v 2 (E) = r\m + (ro — r{)m = rom. 
This completes the proof of prop. 6, and hence of th.10 and th.ll. □ 

13.4. Remarks on the non simply connected case. The proof above 
could have been given without assuming that the split group G is simply 
connected. The main difference is in lemma 9: in the general case, the torus 
T v is a product of three factors (instead of two): 

T v = (G m ) a x (G^ x (R K/k G m y , 
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where a, /3, 7 are integers, with /3 + 7 = ro and a + 7 = ri. This gives the 
following formulae for V2(E) : 

case (b) : V2(E) = a + /3m + j(m + 1) = r± + r^m 

case (c) : V2(E) = a + /3m + 7m = a + rom . 
In case (b) one finds the same value for V2(A), namely the M-bound. 
In case (c) one finds a result which is intermediate between the M-bound 
r\ + rom + V2(W) and the value rom + V2(W) given by th.ll. 

Examples (assuming we are in case (c)). 

- Type A r , r even. One finds that a is always 0, so that one does not gain 
anything by choosing non simply connected groups. Indeed, in that case, it 
is possible to prove, by a variant of Schur's method, that the value of V2(A) 
given by th.ll is best possible. 

- Type A r , r odd > 3. Here r\ = (r — l)/2. One finds that a = if r = 1 
(mod 4), but that a can be equal to 1 if r = 3 (mod 4). When r = 3, we 
thus get a = r\\ this shows that the M-bound is best possible for type A3. 

- Type D r , r odd. Here r± = 1, and if one chooses G neither simply 
connected nor adjoint, one has a = 1. This means that the orthogonal 
group S02r has an inner fc-form which contains an optimal A. (Note the 
case r = 3, where D3 = A3.) 

- Type Eq. Here r\ = 2, and one has a = both for the simply connected 
group and for the adjoint group (indeed, a is for every adjoint group). 
I do not know whether the bound of th.ll is best possible in this case. 

§14. The case m = 00 

14.1. Statements. We keep the notation (G, R, W, di,£, t, m) of §4 and §6; 
as before, we assume that G is of inner type. 

We consider the case m = 00, i.e. the case where the image of Xi°° JS 
finite; that image is then cyclic of order t, cf. §4. 

Let a(t) be the number of i's such that di = (mod t). If a(t) = 0, then 
G(k) is ^-torsion free, cf. §6.2, cor. to prop. 4. In what follows, we shall thus 
assume that a(t) > 1. In that case, G(k) may contain infinite ^-subgroups 
(we say that a group is an i-group if every element of that group has order 
a power of £). The following two theorems show that a(t) controls the size 
of such a subgroup: 

Theorem 12. Let A be an l-subgroup of G(k). Then A contains a subgroup 
of finite index isomorphic to the i-group (Q^/Z^) a ; with a < a(t). 

(Note that Q^/Z^ is the union of an increasing sequence of cyclic groups 
of order £, £ 2 , . . . ; it is the analogue of Z/£ m Z for m = 00.) 

The bound a < a(t) of th.12. is optimal. More precisely: 

Theorem 13. There exist a semisimple group G of inner type, with root 
system R, and an £- subgroup A of G(k), such that A is isomorphic to the 
product of a(t) copies o/Q^/Z^. 

14.2. Proof of theorem 12. We need a few lemmas: 
Lemma 10. Any finitely generated £-subgroup of G(k) is finite. 
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Proof. Let B be a finitely generated ^-subgroup of G(k). We may embed B 
in GL n (k) for n large enough. By a known result (see §1.2) there exists a 
subgroup B' of B, of finite index, which is torsion-free if char(fc) = 0, and 
has only p-torsion if char(fc) = p. Since B' is an £-group, this means that 
B' = 1, hence -B is finite. □ 

Lemma 11. There exists a maximal k-torus of G which is normalized by 
A. (Recall that A is an ^-subgroup of G(k).) 

Proof. Let F be the set of all finite subgroups of A, ordered by inclusion. 
Lemma 10 implies that, if B\ and B2 belong to F, so does {B\, B2). Let X 
be the /c-variety parametrizing the maximal tori of G; it is a homogeneous 
space of G. If B G F, let X B be the subvariety of X fixed by B; a point 
of X B corresponds to a maximal torus of G normalized by B. By the 
noetherian property of the scheme X, one may choose Bq € F such that 
X is minimal among the X B 's. If B e F, then x( B °' B *> is contained 
in X B() , hence equal to X B °. This shows that X B ° is contained in all the 
X B 's, i.e. that every maximal torus which is normalized by Bq is normalized 
by all the B 7 s, hence by A. By the corollary to th.3" of §3.3 (applied to the 
finite £-group Bq) there exists such a torus which is defined over k. □ 

Lemma 12. Let u G M r (Z^) be anrxr matrix with coefficients in Z^, which 
we view as an endomorphism o/(Q^/Z^) r . Then Ker(u) has a subgroup of 
finite index isomorphic to the product of r — rank(it) copies of Qe/^e- 

In other words, the "corank" of Ker(ii) is equal to r — rank(n). 

Proof. Same as that of lemma 4 of §5.2: by reduction to the case where u 
is a diagonal matrix. □ 

Lemma 13. Let zt be a primitive t-th root of unity, and let w be an element 
ofW. The multiplicity of z t as an eigenvalue of w is < a(t). 

Proof. See |Sp 74| , th.3.4(i) where it is deduced from the fact that the 
polynomial det(t — w) divides IJi(* ~~ ■*-)• ^ 

Lemma 14. Let T be a maximal k-torus of G, and let T(k)i be the l-torsion 
subgroup ofT{k). We have corank T{k)z < a(t). 

As above, the "corank" of a commutative ^-group is the largest n such 
that the group contains the product of n copies of Q^/Z^. 

Proof. As in §5.2, let Y(T) = Hom/% s (G m ,T) be the group of cocharacters 
of T. The action of the Galois group T/% on Y(T) gives a homomorphism 

p : T fc -> Aut Y(T) ~ GL r (Z) 

and the image of p is contained in the Weyl group W (this is still another way 
of saying that G is of inner type). The group acts on T(k s )e ~ (Qe/Zi£) r 
by p ® Xi where x = X e °c ■ Let us now choose g G Tfc such that = z^ 1 , 
where Zt is an element of order t of Z|, and let w = p{g). The element g 
acts on T(k s )i by wz^ . Let T g be the kernel of g — 1 on T(k s )f . By lemma 
12, we have corank (T g ) = r— rank(<? — 1), which is equal to the multiplicity 
of Zt as an eigenvalue of w; using lemma 13, we get corank(T 9 ) < a(t), and 
since T(k)g is contained in T g , we have corank(T(/s)£) < a(t). □ 
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End of the proof of th.12. By lemma 11, there is a maximal /c-torus T of 
G which is normalized by A. Let A° = A n T(k). Then A° is an abelian 
subgroup of A of finite index. Since ^4° is contained in T{k)^, lemma 14 
shows that A° is isomorphic to the product of a finite group with a product 
of at most a(t) copies of Q^/Z^. □ 

14.3. Proof of theorem 13. We follow the same strategy as for theorem 
9, 10 and 11. There are three cases: 

14.3.1. Classical groups (I / 2). We change slightly the definitions of 
§9.1: we define An as the subgroup of G~Ln(K), with K = k(zi), made up 
of the diagonal matrices of order a power of £; it is isomorphic to (Q^/Z^)^. 

For any given n > 2, we put N = [n/t] and we get embeddings 

An -> GL N (K) -> GLjvt(fc) -> GL n (fc). 

If t > 1, one checks that the /c-determinant of every element of An is 1; 
we thus get an embedding An — > SL n (fc) which has the required properties 
since N = a(t) in that case. When t = 1, we replace An by the subgroup of 
its elements of fc-determinant 1, and we also get what we want. This solves 
the case of type A r . Types B r , C r and D r are then treated by the methods 
of §9.3 and §9.4. 

14.3.2. Exceptional groups (1^2). One replaces prop. 5 of §11.4 by a 
statement giving the existence of A C G^k) with A ~ (Q^/Z^)°^. The 
proof is the same. One then proceeds as in §12. 



14.3.3. The case 1 = 2. Same method as in §13. 



□ 
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